
PATENT 
Docket No. 285.00210101 



COMPUTER-BASED MULTI-MEDIA 
COMMUNICATIONS SYSTEM AND METHOD 



Reference to Related Applications 



This application is a continuation of U.S. Patent Application Serial No. 
09/400,607 filed 20 September 1999 (pending) which is a contmuation of U.S. 
Patent Application Serial No. 08/338,340 filed 10 November 1994 (issued as 
U.S. Patent No. 6,009,082) which is a continuation in part of U.S. Patent 
Application No. 08/002,467 filed 8 January 1993 (issued as U.S. Patent No. 
5,452,289), wherein the complete applications of which, including any 
microfiche appendix, are incorporated herein by reference. 

Field of the Invention 

The present invention relates to communications systems and, in 
particular, to computer-assisted multi-media communications. 



A wide variety of commxmications alternatives are currently available to 
telecommunications users. For example, facsimile transmission of printed 
matter is available through what is commonly referred to as a stand-alone fax 
machine. Alternatively, fax-modem communication systems are currently 
available for personal computer users which combine the operation of a 
facsimile machine with the word processor of a computer to transmit documents 
held on computer disk. Modem commimication over telephone lines in 
combination with a personal computer is also known in the art, where file 
transfers can be accomplished from one computer to another. Also, 
simultaneous voice and modem data transmitted over the same telephone line 
has been accomplished in several ways. 

There is a need in the art, however, for a personal commimications 
system which combines a wide variety of communication functions into an 
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integrated hardware-software product such that the user can conveniently choose 
a mode of communication and have that communication automatically invoked 
from a menu-driven selection system. 

5 Summary of the Invention 

The present disclosure describes a complex computer-assisted 
conununications system which contains multiple inventions. The subject of the 
present multiple inventions is a personal communications system which includes 
components of software and hardware operating in conjimction with a personal 

10 computer. The user interface control software operates on a personal computer, 
preferably within the Microsoft Windows® environment. The software control 
system communicates with hardware components linked to the software through 
the personal computer serial commimications port. The hardware components 
include telephone communication equipment, digital signal processors, and 

1 5 hardware to enable both fax and data communication with hardware components 
at a remote site connected through a standard telephone line. The fimctions of 
the hardware components are controlled by control software operating within the 
hardware components and from the software components operating within the 
personal computer. 

20 Communications between the software components running on the 

personal computer and the local hardware components over the serial 
commimications link is by a special packet protocol for digital data 
communications. This bi-directional communications protocol allows 
uninterrupted bi-directional fiUl-duplex transfer of both control information and 

25 data conMiunication. 

The major functions of the present system are a telephone function, a 
voice mail fimction, a fax manager fimction, a multi-media mail fimction, a 
show-and-tell fimction, a terminal function, and an address book fimction. The 
telephone function allows the present system to operate, from the user's 

30 perspective, as a conventional telephone using either hands-free, headset, or 

handset operation. The telephone fimction is more sophisticated than a standard 
telephone in that the present system converts the voice into a digital signal which 
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can be processed with echo cancellation, compressed, stored as digital data for 
later retrieval, and transmitted as digital voice data concurrent with the transfer 
of digital information data. 

The voice mail function of the present system operates as a telephone 
5 answering machine which can receive, compress, and store voice messages for 
later retrieval or reuse in response messaging. 

The fax manager function of the present system allows the transmission 
and reception of facsimile information. The software component of the present 
system operates in conjunction with other commercially-available software 
10 programs, such as word processors and the like, to transmit and receive facsimile 
pages of digital data stored on a computer system. 

The multi-media mail component of the present system allows the 
operator to create documents that include text, graphics, and voice mail 
messages which can be sent as a combined package over conventional telephone 
15 lines for receipt at a like-configured site using the present system. 

The show-and-tell component of the present system enables the operator 
to simultaneously transmit voice and data commxmication to a remote site. This 
voice over data function dynamically allocates data bandwidth over the 
telephone line depending on the demands of the voice grade digitized signal. 
20 The terminal feature of the present system allows the user to establish a 

data commxmications session wdth another computer system allowing the user's 
local computer system to operate as a dumb terminal. 

The address book function of the present system is a versatile database 
that is built by the user and operates in conjvmction with the other components of 
25 the present system to dial and establish communication links with remote sites to 
enable data communication, voice mail, facsimile, and file transfer all in an 
automated mode without user intervention. 

The hardware components of the present system include circuitry to 
enable digital data communication and facsimile communication over standard 
30 telephone lines. The hardware components also include circuitry to convert the 
voice to digital data and compress that data for transfer to the software 
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component on the personal computer or transfer it over the telephone lines to a 
remote site. 

Many of the functions of the present system are accomplished by 
including a voice control digital signal processor (DSP) to operate in conjunction 
5 with a data/fax modem implemented with a data pump DSP. The data pump 
DSP and the voice control DSP accomplish the following functions in an 
integrated hardware arrangement. 

A sophisticated telephone apparatus with its attached handset, headset, 
and a built-in hands-free telephone operation using the integrated microphone 
10 and speaker system. The hands-free telephone works in full-duplex mode 

through the use of voice echo cancellation performed by the voice control DSP. 

The voice control DSP, in conjunction with a telephone CODEC, 
provides voice compression which can be sent to the computer system that is 
attached to the RS232 port for storage and later retrieval. The compressed voice 
15 from the voice control DSP can also be multiplexed with the input data stream 
from the personal computer with dynamic time allocation. Whereas, the input 
data from the attached computer is transmitted using the error control protocol 
like MNP or V.42 with or without data compression (e.g., V.42bis), the speech is 
packetized using a different header defining it as a speech packet and then 
20 transmitted through a controller. The speech packets, like the data packets, have 
the attached CRC codes. However, the speech packets are not sequenced, and 
the like hardware at the receiving end ignores the accompanying CRC codes for 
voice packets and passes the voice packets to the voice control DSP for 
decompression. The decompressed speech is played through one of the 
25 telephone receiving units, i.e., the headset, handset, or the built-in speaker. 

The voice control DSP allows the compressed speech to be recorded on a 
recording media, e.g., the hard disk drive of the attached computer system. This 
provides the function of an answering machine. In addition to the answering 
machine function, the recorded speech can be provided for the voice mail 
30 functions. 

The special packet protocol over the RS232 interface between the 
software component and the hardware component that governs the operation of 
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the hardware component is so designed that it allows various control functions to 
be intermixed with data over the RS232 serial port. The software component of 
the present system accepts the generic AT modem commands when not in the 
special packet mode. When the hardware component is configured to accept the 
5 packet level protocol over the RS232 port, it can be made to switch to the 
generic command mode through the use of a break sequence. 

The hardware components of the present system function as a data/fax 
modem when the speech compression or telephone mode is not invoked. The 
packet mode or the generic AT command mode may be used for this purpose. 

1 0 The hardware components of the present system incorporate a provision 

for a special link integrity packet to facilitate the device to work over cellular 
networks. This scheme allows the modem in one of its plurality of modes to 
ignore the carrier drop-outs (selective fading) inherent in the cellular networks. 
Such a scheme does not use carrier detect circuitry of the modem. The 

1 5 disconnect of the cellular connection is done through a negotiation scheme using 
packet interchange between the two ends of the link. 

In cellular networks, the multiplexed voice data technology of the present 
system allows a single apparatus to function as a smart telephone, an intelligent 
data modem, as well as a fax modem. These features, along with the voice data 

20 multiplex mode, provide a traveling user complete fi-eedom to use his or her 
moving vehicle as a true traveling office. 

These features of the hardware component of the present system, along 
with the features of the software component of the present system ruiming on a 
PC, provides a user with a complete range of telecommunications functions of a 

25 modem office, be it stationary or mobile. 



Description of the Drawing s 

Li the drawings, where like numerals describe like components 
throughout the several views, 
30 Figure 1 shows the telecommunications environment within which the 

present system may operate in several of the possible modes of communication; 
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Figure 2 is the main menu icon for the software components operating on 
the personal computer; 

Figure 3 is a block diagram of the hardware components of the present 

system; 

5 Figure 4 is a key for viewing the detailed electrical schematic diagrams of 

Figures 5 A- IOC to facilitate imderstanding of the interconnect between the 
drawings; 

Figures 5A-5C, 6A-6C, 7A-7C, 8A-8B, 9A-9C, and lOA-lOC are 
detailed electrical schematic diagrams of the circuitry of the hardware 
1 0 components of the present system; 

Figure 1 1 is a signal flow diagram of the speech compression algorithm; 

Figures 12 is a detailed function flow diagram of the speech compression 
algorithm; 

Figure 13 is a detailed function flow diagram of the speech 
1 5 decompression algorithm; 

Figure 14 is a detailed function flow diagram of the echo cancellation 
algorithm; 

Figure 1 5 is a detailed function flow diagram of the voice/data 
multiplexing function; 
20 Figure 16 is a perspective view of the components of a digital computer 

compatible with the present invention; 

Figure 17 is a block diagram of the software structure compatible with 
the present invention; 

Figure 1 8 is a block diagram of the control structure of software 
25 compatible with the present invention; 

Figure 19 is a block diagram of the main menu structure of software 
compatible with the present invention; 

Figure 20 is a flow diagram of answer mode software compatible with 
the present invention; 
30 Figure 21 is a flow diagram of telephone software compatible with the 

present invention; 
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Figure 22 is a flow diagram of voice mail software compatible with the 
present invention; 

Figure 23 is a flow diagram of fax manager software compatible with the 
present invention; 

5 Figure 24 is a flow diagram of multi-media mail software compatible 

with the present invention; 

Figure 25 is a flow diagram of a timing loop compatible with the present 
invention; 

Figure 26 is a flow diagram of telephone control software compatible 
1 0 with the present invention; 

Figure 27 is a flow diagram of voice mail control software compatible 
with the present invention; 

Figure 28 is a flow diagram of high resolution fax driver software 
compatible with the present invention; 
1 5 Figure 29 is a flow diagram of low resolution fax driver software 

compatible with the present invention; 

Figure 30 is a flow diagram of multi-media mail control software 
compatible with the present invention; 

Figure 3 1 is a flow diagram of multi-media mail editor software 
20 compatible with the present invention; 

Figure 32 is a flow diagram of multi-medial mail transmit software 
compatible with the present invention; 

Figure 33 is a flow diagram of multi-medial mail receive software 
compatible with the present invention; 
25 Figure 34 is a flow diagram of show-and-tell transmit software 

compatible with the present invention; 

Figure 35 is a flow diagram of show-and-tell receive software compatible 
with the present invention; 

Figure 36 is a flow diagram of voice mail transmit software compatible 
30 with the present invention; 

Figure 37 is a flow diagram of voice mail receive software compatible 
with the present invention; 
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Figure 38 is a flow diagram of an outgoing timer loop compatible with 
the present invention; 

Figure 39 is a flow diagram of an incoming timer loop compatible with 
the present invention; 
5 Figure 40 is an initialization screen display compatible with the present 

invention; 

Figure 41 is a communication port setup screen display compatible with 
the present invention; 

Figure 42 is an answer mode setup screen display compatible with the 
10 present invention; 

Figure 43 is a hold call setup screen display compatible with the present 
invention; 

Figure 44 is a voice mail setup screen display compatible with the present 
invention; 

15 Figure 45 is a PBX setup screen display compatible with the present 

invention; 

Figure 46 is a fax setup screen display compatible with the present 
invention; 

Figure 47 is a multi-media mail set-up screen display compatible with the 
20 present invention; 

Figure 48 is a show-and-tell set-up screen display compatible with the 
present invention; 

Figure 49 is a telephone control screen display compatible with the 
present invention; 

25 Figure 50 is a voice mail control screen display compatible with the 

present invention; 

Figure 5 1 is a voice editor screen display compatible with the present 
invention; 

Figure 52 is a fax manager control screen display compatible with the 
30 present invention; 

Figure 53 is a multi-media mail control screen display compatible with 
the present invention; 
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Figure 54 is a show-and-tell control screen display compatible with the 
present invention; 

Figure 55 is an address book control screen display compatible with the 
present invention; 

5 Figure 56 is a voice message destination screen display compatible with 

the present invention; and 

Figure 57 is a message composer screen display compatible with the 
present invention. 

10 Detailed Description of the Preferred Embodiments 

The specifications for the multiple inventions described herein includes 
the present description, the drawings, and a microfiche appendix. In the 
following detailed description of the preferred embodiment, reference is made to 
the accompanying drawings which form a part hereof, and in which is shown by 

1 5 way of illustration specific embodiments in which the inventions may be 

practiced. These embodiments are described in sufficient detail to enable those 
skilled in the art to practice the invention, and it is to be understood that other 
embodiments may be utilized and that structural changes may be made without 
departing fi*om the spirit and scope of the present inventions. The foUov^g 

20 detailed description is, therefore, not to be taken in a limiting sense, and the 
scope of the present inventions is defined by the appended claims. 

Figure 1 shows a typical arrangement for the use of the present system. 
Personal computer 10 is running the software components of the present system 
while the hardware components 20 include the data commvmication equipment 

25 and telephone headset. Hardware components 20 commimicate over a standard 
telephone line 30 to one of a variety of remote sites. One of the remote sites may 
be equipped v^th the present system, including hardware components 20a and 
software components running on personal computer 10a. In one altemative use, 
the local hardware components 20 may be communicating over standard 

30 telephone line 30 to facsimile machine 60. In another altemative use, the present 
system may be communicating over a standard telephone line 30 to another 
personal computer 80 through a remote modem 70. In another altemative use. 
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the present system may be communicating over a standard telephone line 30 to a 
standard telephone 90. Those skilled in the art will readily recognize the wide 
variety of conmiimication interconnections possible with the present system by 
reading and understanding the following detailed description. 

The ornamental features of the hardware components 20 of Figure 1 are 
claimed as part of Design Patent Application Nxmiber 29/001368, filed 
November 12, 1992, entitled, "Telephone/Modem case for a Computer-Based 
Multifunction Personal Commimications System," assigned to the same assignee 
of the present inventions and hereby incorporated by reference. 



General Overview 

The present inventions are embodied in a commercial product by the 
assignee, Multi-Tech Systems, Inc. The software component operating on a 
personal computer is sold under the commercial trademark of MultiExpress 

1 5 PCS™ personal commimications software, while the hardware component of the 
present system is sold imder the commercial name of MultiModemPCS™, 
Intelligent Personal Commimications System Modem. In the preferred 
embodiment, the software component runs under Microsoft® Windows®, 
however, those skilled in the art will readily recognize that the present system is 

20 easily adaptable to run under any single- or multi-user, single- or multi-window 
operating system. 

The present system is a multi-function communication system which 
includes hardware and software components. The system allows the user to 
connect to remote locations equipped with a similar system or with modems, 

25 facsimile machines, or standard telephones over a single analog telephone line. 
The software component of the present system includes a number of modules 
which are described in more detail below. 

Figure 2 is an example of the Windows®-based main menu icon of the 
present system operating on a personal computer. The functions listed with the 

30 icons used to invoke those functions are shown in the preferred embodiment. 
Those skilled in the art will readily recognize that a wide variety of selection 
techniques may be used to invoke the various functions of the present system. 



10 




The icon of Figure 2 is part of Design Patent Application Number 29/001397, 
filed November 12, 1992, entitled, "Icons for a Computer-Based Multifunction 
Personal Commimications System," assigned to the same assignee of the present 
inventions and hereby incorporated by reference. 
5 The telephone module allows the system to operate as a conventional or 

sophisticated telephone system. The system converts voice into a digital signal 
so that it can be transmitted or stored with other digital data, like computer 
information. The telephone function supports PBX and Centrex features such as 
call waiting, call forwarding, caller ID, and three-way calling. This module also 

10 allows the user to mute, hold, or record a conversation. The telephone module 
enables the handset, headset, or hands-fi"ee speaker telephone operation of the 
hardware component. It includes on-screen pushbutton dialing, speed-dial of 
stored numbers, and digital recording of two-way conversations. 

The voice mail portion of the present system allows this system to 

15 operate as a telephone answering machine by storing voice messages as digitized 
voice files along with a time/date voice stamp. The digitized voice files can be 
saved and sent to one or more destinations immediately or at a later time using a 
queue scheduler. The user can also listen to, forward, or edit the voice messages 
which have been received with a powerful digital voice editing component of the 

20 present system. This module also creates queues for outgoing messages to be 

sent at pre-selected times and allows the users to create outgoing messages with 
the voice editor. 

The fax manager portion of the present system is a queue for incoming 
and outgoing facsimile pages. In the preferred embodiment of the present 

25 system, this function is tied into the Windows "print" conmiand once the present 
system has been installed. This feature allows the user to create faxes from any 
Windows®-based docxmient that uses the "print" command. The fax manager 
function of the present system allows the user to view queued faxes which are to 
be sent or which have been received. This module creates queues for outgoing 

30 faxes to be sent at pre-selected times and logs incoming faxes with time/date 
stamps. 
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The mixlti-media mail function of the present system is a utility which 
allows the user to compose documents that include text, graphics, and voice 
messages using the message composer function of the present system, described 
more fully below. The multi-media mail utility of the present system allows the 
5 user to schedule messages for transmittal and queues up the messages that have 
been received so that they can be viewed at a later time. 

The show-and-tell function of the present system allows the user to 
establish a data over voice (DOV) commxmications session. When the user is 
transmitting data to a remote location similarly equipped, the user is able to talk 

10 to the person over the telephone line while concurrently transferring the data. 

This voice over data function is accomplished in the hardware components of the 
present system. It digitizes the voice and transmits it in a dynamically changing 
allocation of voice data and digital data multiplexed in the same transmission. 
The allocation at a given moment is selected depending on amount of voice 

15 digital information required to be transferred. Quiet voice intervals allocate 
greater space to the digital data transmission. 

The terminal function of the present system allows the user to establish a 
data conmiunications session with another computer which is equipped with a 
modem but which is not equipped with the present system. This feature of the 

20 present system is a Windows®-based data communications program that reduces 
the need for issuing "AT" commands by providing menu-driven and "pop-up" 
window alternatives. 

The address book function of the present system is a database that is 
accessible from all the other functions of the present system. This database is 

25 created by the user inputting destination addresses and telephone numbers for 
data communication, voice mail, facsimile transmission, modem 
communication, and the like. The address book function of the present system 
may be utilized to broadcast communications to a wide variety of recipients. 
Multiple linked databases have separate address books for different groups, and 

30 different destinations may be created by the users. The address book fimction 
includes a textual search capability which allows fast and efficient location of 
specific addresses, as described more fully below. 
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Multiplexor (MUX) circuit 310 selects between the voice control DSP 
circuit 306 and the data pump DSP circuit 311 for transmission of information 
on the telephone line through telephone line interface circuit 309. 

The data pump circuit 311 also includes a digital signal processor (DSP) 
5 and a CODEC for communicating over the telephone line interface 309 through 
MUX circviit 310. The data pump DSP and CODEC of circuit 311 performs 
functions such as modulation, demodulation, and echo cancellation to 
communicate over the telephone line interface 309 using a plurality of 
telecommunications standards, including FAX and modem protocols. 

1 0 The main controller circuit 313 controls the DSP data pump circuit 311 

and the voice control DSP circuit 306 through serial input/output and clock timer 
control (SIO/CTC) circuits 312 and dual port RAM circuit 308, respectively. 
The main controller circuit 313 communicates with the voice control DSP 306 
through dual port RAM circuit 308. In this fashion, digital voice data can be 

15 read and written simultaneously to the memory portions of circuit 308 for high 

speed commvinication between the user (through interfaces 301, 302, or 303/304) 
and the personal computer connected to serial interface circuit 315 and the 
remote telephone connection connected through the telephone line attached to 
line interface circuit 309. 

20 As described more fully below, the main controller circuit 313 includes, 

in the preferred embodiment, a microprocessor which controls the functions and 
operation of all of the hardware components shown in Figure 3. The main 
controller is connected to RAM circuit 316 and a programmable and electrically 
erasable read-only memory (PEROM) circuit 317. The PEROM circuit 317 

25 includes non- volatile memory in which the executable control programs for the 
voice control DSP circuits 306 and the main controller circuits 313 operate. 

The RS232 serial interface circuit 315 communicates to the serial port of 
the personal computer which is running the software components of the present 
system. The RS232 serial interface circuit 315 is connected to a serial 

30 input/output circuit 314 with main controller circuit 313. SIO circuit 3 14 is in 
the preferred embodiment, a part of SIO/CTC circuit 312. 
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the telephone interface is used to send the analog speech patterns to the digital 
telephone CODEC circuit 305. Circuit 305 digitizes the voice patterns and 
passes them to voice control circuit 306 where the digitized voice patterns are 
digitally compressed. The digitized and compressed voice patterns are passed 
through dual port RAM circuit 308 to the main controller circuit 313 where they 
are transferred through the serial interface to the personal computer using a 
packet protocol defined below. The voice patterns are then stored on the disk of 
the personal computer for later use in multi-media mail, for voice mail, as a pre- 
recorded answering machine message, or for later pre-determined transmission 
to other sites. 

For the present system to operate as an answering machine, the hardware 
components of Figure 3 are placed in answer mode. An incoming telephone ring 
is detected through the telephone line interface circuit 309, and the main 
controller circuit 313 is alerted which passes the information off to the personal 
computer through the RS232 serial interface circuit 315. The telephone line 
interface circuit 309 seizes the telephone line to make the telephone connection. 
A pre-recorded message may be sent by the personal computer as compressed 
and digitized speech through the RS232 interface to the main controller circuit 
313. The compressed and digitized speech from the personal computer is passed 
from main controller circuit 313 through dual port RAM circuit 308 to the voice 
control DSP circuit 306 where it is uncompressed and converted to analog voice 
pattems. These analog voice patterns are passed through multiplexor circuit 310 
to the telephone line interface 309 for transmission to the caller. Such a message 
may invite the caller to leave a voice message at the sound of a tone. The 
incoming voice messages are received through telephone line interface 309 and 
passed to voice control circuit 306. The analog voice pattems are digitized by 
the integral CODEC of voice control circuit 306 and the digitized voice pattems 
are compressed by the voice control DSP of the voice control circuit 306. The 
digitized and compressed speech pattems are passed through dual port RAM 
circuit 308 to the main controller circuit 313, where they are transferred using 
packet protocol described below through the RS232 serial interface 315 to the 
personal computer for storage and later retrieval. In this fashion, the hardware 
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components of Figure 3 operate as a transmit and receive voice mail system for 
implementing the voice mail function 1 17 of the present system. 

The hardware components of Figure 3 may also operate to facilitate the 
fax manager function 119 of Figure 2. In fax receive mode, an incoming 
5 telephone call will be detected by a ring detect circuit of the telephone line 

interface 309 which will alert the main controller circuit 3 13 to the incoming 
call. Main controller circuit 313 will cause line interface circuit 309 to seize the 
telephone line to receive the call. Main controller circuit 313 will also 
concurrently alert the operating programs on the personal computer through the 

1 0 RS232 interface using the packet protocol described below. Once the telephone 
line interface seizes the telephone line, a fax carrier tone is transmitted and a 
return tone and handshake is received from the telephone line and detected by 
the data pump circuit 311. The reciprocal transmit and receipt of the fax tones 
indicates the imminent receipt of a facsimile transmission, and the main 

1 5 controller circuit 313 configures the hardware components of Figure 3 for the 
receipt of that information. The necessary handshaking with the remote 
facsimile machine is accomplished through the data pump 311 under control of 
the main controller circuit 313. The incoming data packets of digital facsimile 
data are received over the telephone line interface and passed through data pump 

20 circuit 3 1 1 to main controller circuit 313, which forwards the information on a 
packet basis (using the packet protocol described more fully below) through the 
serial interface circuit 3 15 to the personal computer for storage on disk. Those 
skilled in the art will readily recognize that the fax data could be transferred from 
the telephone line to the personal computer using the same path as the packet 

25 transfer except using the normal AT stream mode. Thus, the incoming facsimile 
is automatically received and stored on the personal computer through the 
hardware components of Figure 3. 

A facsimile transmission is also facilitated by the hardware components 
of Figure 3. The transmission of a facsimile may be immediate or queued for 

30 later transmission at a pre-determined or pre-selected time. Control packet 

information to configure the hardware components to send a facsimile are sent 
over the RS232 serial interface between the personal computer and the hardware 



17 




components of Figure 3 and are received by main controller circuit 313. The 
data pump circuit 311 then dials the recipient's telephone number using DTMF 
tones or pulse dialing over the telephone line interface circuit 309. Once an 
appropriate connection is established with the remote facsimile machine, 
5 standard facsimile handshaking is accomplished by the data pump circuit 311. 
Once the facsimile connection is established, the digital facsimile picture 
information is received through the data packet protocol transfer over serial line 
interface circuit 315, passed through main controller circuit 313 and data pump 
circuit 311 onto the telephone line through telephone line interface circuit 309 

1 0 for receipt by the remote facsimile machine. 

The operation of the multi-media mail function 121 of Figure 2 is also 
facilitated by the hardware components of Figure 3. A multi-media transmission 
consists of a combination of picture information, digital data, and digitized voice 
information. For example, the type of multi-media information transferred to a 

15 remote site using the hardware components of Figure 3 could be the multi-media 
format of the Microsoft® Multimedia Wave® format with the aid of an Intelligent 
Serial Interface (ISI) card added to the personal computer. The multi-media may 
also be the type of multi-media information assembled by the software 
component of the present system, which is described more fully below. 

20 The multi-media package of information including text, graphics, and 

voice messages (collectively called the multi-media docxmient) may be 
transmitted or received through the hardware components shown in Figure 3. 
For example, the transmission of a multi-media document through the hardware 
components of Figure 3 is accomplished by transferring the multi-media digital 

25 information using the packet protocol described below over the RS232 serial 

interface between the personal computer and the serial line interface circuit 315. 
The packets are then transferred through main controller circuit 313 through the 
data pxmip circuit 311 on to the telephone line for receipt at a remote site through 
telephone line interface circuit 309. In a similar fashion, the multi-media 

30 docimients received over the telephone line from the remote site are received at 
the telephone line interface circuit 309, and passed through the data pump circuit 
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311 for receipt and forwarding by the main controller circuit 313 over the serial 
line interface circuit 315. 

The show-and-tell function 123 of the present system allows the user to 
establish a data over voice commxmication session. In this mode of operation, 
5 full-duplex data transmission may be accomplished simultaneously with the 

voice communication between both sites. This mode of operation assvmies a like 
configured remote site. The hardware components of the present system also 
include a means for sending voice/data over cellular links. The protocol used for 
transmitting multiplexed voice and data include a supervisory packet described 

10 more fully below to keep the link established through the cellular link. This 
supervisory packet is an acknowledgment that the link is still up. The 
supervisory packet may also contain link information to be used for adjusting 
various link parameters when needed. This supervisory packet is sent every 
second when data is not being sent, and if the packet is not acknowledged after a 

1 5 specified number of attempts, the protocol would then give an indication that the 
cellular link is down and then allow the modem to take action. The action could 
be, for example, change speeds, retrain, or hang up. The use of supervisory 
packets is a novel method of maintaining inherently intermittent cellular links 
when transmitting multiplexed voice and data. 

20 The voice portion of the voice over data transmission of the show-and- 

tell function is accomplished by receiving the user's voice through the telephone 
interface 301, 302, or 303, and the voice information is digitized by the digital 
telephone circuit 305. The digitized voice information is passed to the voice 
control circuit 306, where the digitized voice information is compressed using a 

25 voice compression algorithm described more fully below. The digitized and 
compressed voice information is passed through dual port RAM circuit 308 to 
the main controller circuit 313. During quiet periods of the speech, a quiet flag 
is passed from voice control circuit 306 to the main controller 313 through a 
packet transfer protocol described below by a diml port RAM circuit 308. 

30 Simultaneous with the digitizing compression and packetizing of the 

voice information is the receipt of the packetized digital information from the 
personal computer over interface line circuit 315 by main controller circuit 313. 
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Main controller circuit 313 in the show-and-tell function of the present system 
must efficiently and effectively combine the digitized voice information with the 
digital information for transmission over the telephone line via telephone line 
interface circuit 309. As described above, and as described more fixlly below, 
5 main controller circuit 313 dynamically changes the amoxmt of voice information 
and digital information transmitted at any given period of time, depending upon 
the quiet times during the voice transmissions. For example, during a quiet 
moment where there is no speech information being transmitted, main controller 
circuit 313 ensures that a higher volume of digital data information be 
10 transmitted over the telephone line interface in lieu of digitized voice 
mformation. 

Also, as described more fiiUy below, the packets of digital data 
transmitted over the telephone line interface with the transmission packet 
protocol described below requires 100% accuracy in the transmission of the 

1 5 digital data, but a lesser standard of accuracy for the transmission and receipt of 
the digitized voice information. Since digital information must be transmitted 
with 100% accuracy, a corrupted packet of digital information received at the 
remote site must be retransmitted. A retransmission signal is commxmicated 
back to the local site and the packet of digital information which was corrupted 

20 during transmission is retransmitted. If the packet transmitted contained voice 
data, however, the remote site uses the packets whether they were corrupted or 
not, as long as the packet header was intact. If the header is corrupted, the 
packet is discarded. Thus, the voice information may be corrupted without 
requesting retransmission, since it is understood that the voice information must 

25 be transmitted on a real-time basis and the corruption of any digital information 
of the voice signal is not critical. In contrast to this, the transmission of digital 
data is critical and retransmission of corrupted data packets is requested by the 
remote site. 

The transmission of the digital data follows the CCITT V.42 standard, as 
30 is well known in the industry and as described in the CCITT Blue Book, Volimie 
Vni, entitled Data Cormnunication over the Telephone Network, 1989. The 
CCITT V.42 standard is hereby incorporated by reference. The voice data packet 
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information also follows the CCITT V.42 standard but uses a different header 
format so the receiving site recognizes the difference between a data packet and 
a voice packet. The voice packet is distinguished from a data packet by using 
undefined bits in the header (80 hex) of the V.42 standard. The packet protocol 
5 for voice over data transmission during the show-and-tell function of the present 
system is described more fully below. 

Since the voice over data communication with the remote site is full- 
duplex, incoming data packets and incoming voice packets are received by the 
hardware components of Figure 3. The incoming data packets and voice packets 

10 are received through the telephone line interface circuit 309 and passed to the 
main controller circuit 313 via data pump DSP circuit 311. The incoming data 
packets are passed by the main controller circuit 3 1 3 to the serial interface circuit 
3 1 5 to be passed to the personal computer. The incoming voice packets are 
passed by the main controller circuit 3 1 3 to the dual port RAM circuit 308 for 

15 receipt by the voice control DSP circuit 306. The voice packets are decoded and 
the compressed digital information therein is uncompressed by the voice control 
DSP of circuit 306. The uncompressed digital voice information is passed to 
digital telephone CODEC circuit 305 where it is reconverted to an analog signal 
and retransmitted through the telephone line interface circuits. In this fashion, 

20 full-duplex voice and data transmission and reception is accomplished through 
the hardware components of Figure 3 during the show-and-tell functional 
operation of the present system. 

Terminal operation 125 of the present system is also supported by the 
hardware components of Figure 3. Terminal operation means that the local 

25 personal computer simply operates as a "dumb" terminal including file transfer 
capabilities. Thus, no local processing takes place other than the handshaking 
protocol required for the operation of a dumb terminal. In terminal mode 
operation, the remote site is assumed to be a modem connected to a personal 
computer, but the remote site is not necessarily a site which is configured 

30 according to the present system. In terminal mode of operation, the command 
and data information from the personal computer is transferred over the RS232 
serial interface circuit 315 and forwarded by main controller circuit 3 13 to the 



21 




data pump circxxit 311 where the data is placed on the telephone line via 
telephone line interface circuit 309. 

In a reciprocal fashion, data is received from the telephone line over 
telephone line interface circuit 309 and simply forwarded by the data pump 
5 circuit 311, the main controller circuit 313 over the serial line interface circuit 
315 to the personal computer. 

As described above, and more fully below, the address book function of 
the present system is primarily a support function for providing telephone 
numbers and addresses for the other various functions of the present system. 

10 

Detailed Electrical Schematic Diagrams 

The detailed electrical schematic diagrams comprise Figures 5A-C, 6A- 
C, 7A-C, 8A-B, 9A-C, and lOA-C. Figure 4 shows a key on how the schematic 
diagrams may be conveniently arranged to view the passing of signals on the 

15 electrical lines between the diagrams. The electrical connections between the 
electrical schematic diagrams are through the designators listed next to each 
wire. For example, on the right side of Figure 5 A, address lines A0-A19 are 
attached to an address bus for which the individual electrical lines may appear on 
other pages as A0-A19 or may collectively be connected to other schematic 

20 diagrams through the designator "A" in the circle connected to the collective bus. 
In a like fashion, other electrical lines designated with symbols, such as RNGL 
on the lower left-hand side of Figure 5 A, may connect to other schematic 
diagrams using the same signal designator RNGL. 

Beginning with the electrical schematic diagram of Figure 7C, the 

25 telephone line connection in the preferred embodiment is through connector J2, 
which is a standard 6 -pin modular RJ-1 1 jack. In the schematic diagram of 
Figure 7C, only the tip and ring connections of the first telephone circuit of the 
RJ-1 1 modular connector are used. Ferrite beads FB3 and FB4 are placed on the 
tip and ring wires of the telephone line connections to remove any high 

30 frequency or RF noise on the incoming telephone line. The incoming telephone 
line is also over- voltage protected through SID ACTOR R4. The incoming 
telephone line may be full wave rectified by the full wave bridge comprised of 
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diodes CR27, CR28, CR29, and CR31. Switch S4 switches between direct 
connection and full wave rectified connection depending upon whether the line 
is a non-powered leased line or a standard telephone line. Since a leased line is a 
"dead" line with no voltage, the full wave rectification is not needed. 
5 Also connected across the incoming telephone line is a ring detect circuit. 

Optical isolator U32 (part model number CNY17) senses the ring voltage 
threshold when it exceeds the breakdown voltages on zener diodes CRl and 
CR2. A filtering circuit shown in the upper right comer of Figure 7C creates a 
long RC delay to sense the constant presence of an AC ring voltage and buffers 

10 that signal to be a binary signal out of operational amplifier U25 (part model 

number TL082). Thus, the RNGL and J 1 RING signals are binary signals for use 
in the remaining portions of the electrical schematic diagrams to indicate a 
presence of a ring voltage on the telephone line. 

The present system is also capable of sensing the caller ID information 

1 5 which is transmitted on the telephone line between rings. Between the rings, 

optically isolated relays U30, U31 on Figure 7C and optically isolated relay U33 
on Figure 7B all operate in the period between the rings so that the FSK 
modulated caller ID information is connected to the CODEC and data pxmip DSP 
in Figures 8 A and 8B, as described more fiilly below. 

20 Referring now to Figure 7B, more of the telephone line filtering circuitry 

is shown. Some of the telephone line buffering circuitry, such as inductor LI 
and resistor Rl, are optional and are connected for various telephone line 
standards used aroimd the world to meet local requirements. For example, 
Switzerland requires a 22 millihenry inductor and IK resistor in series with the 

25 line. For all other countries, the IK resistor is replaced with a 0 ohm resistor. 

Relay U29, shown in Figure 7B, is used to accomplish pulse dialing by 
opening and shorting the tip and ring wires. Optical relay X2 is engaged during 
pulse dialing so that the tip and ring are shorted directly. Transistors Q2 and Q3, 
along with the associated discrete resistors, comprise a holding circuit to provide 

30 a current path or current loop on the telephone line to grab the line. 
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Figure 7 A shows the telephone interface connections between the 

. f tv,^ nresent system and the handset, headset, and 
hardware components of the present sysicm cu 

microphone. . _„ 

The co-mecSoos Tl and T2 for fl>e telephone line 6om Figure 7B ^ 

conne^ed ,o tt^fonner TRl shown in ,he elecmcal =chema«c diag^i of 
Figure 8B. Oniy *e AC components of .he si^ pass through Tansfomter 
tT Tleconnectionof signalsa^achedtothesecondaryofTRl isshownfor 
both transmitting and reaving informahon over the telephone Im. 

tocoming signaU are buffered by operation^ amplifiers U27A and U27B. 
The first stage of buffering using operational ampUfier U27B is used (or echo 
oppression so ma. the «tted information being placed on «,e^lephone 
Ji3 not fed ba* into me receive ^rtion of the present system T^e seco^ 

.tage of ate input buffering through operationa. ampUfler U27A ts configured for 
a moderate amount of gain before driving to signal into CODEC U35. 

CODEC chip U35 on Figure 8B, interftce chip U34 on Ftgure 8A, and 
digital signal processor (DSP) chip U37 on Fi^ 8A comprise a da« pump 
chip set manufactured and sold by AT&T Microelec«^nics. A detailed 
aescription of the op^on of fliese three chips in direct connection a^d 

cooperation with one another is described in the publication enti.^^ ATW 
V32bis/V.32/FAXHigh-SpeedDataPumpChipSe.DataBook."pubhs^cdby 

AT&T Microelectronics, December 1991, which is hereby incorporated by 
reference. This AT&T data pump chip set comprises the core of an integ^ted. 
two-wire full-duplex modem which is capable of operation over standard 
.elephonelinesorleasedlines. THe daU pump chip set conforms to to 
..lecommunications specifications in CCITT recommendations V-'^bis "2. 
V 22bis V.22. V.23, V.21, and is compatible with to Bell 212A and 103 
modeml Speeds of 14400, 9600, 4800, 2400. .200, 600, and 300 bits per 
.cond are supported. TWs dal. pump chip set consists of a ^^U-co,^ 
DSP16A digital signal processor U37, an mterftce chip U34, and an AT&T 
T7525 linear CODEC U35. The AT&T V.32 data pump chip set is available 
from AT&T Microelectronics. 
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The chip set U34, U35, and U37 on Figures 8A and 8B perform all A/D, 
D/A, modulation, demodulation, and echo cancellation of all signals placed on or 
taken from the telephone line. The CODEC U35 performs DTMF tone 
generation and detection, signal analysis of call progress tones, etc. The 
5 transmission of information on the telephone line from CODEC U35 is through 
buffer U28A, through CMOS switch U36, and through line buffer U25. The 
CMOS switch U36 is used to switch between the data pump chip set CODEC of 
circuit 310 (shown in Figure 3) and the voice control CODEC of circuit 306 
(also shown in Figure 3). The signal lines AOUTN and AOUTP correspond to 

10 signals received from the voice control CODEC of circuit 306. CODEC U35 is 
part of circuit 311 of Figure 3. 

The main controller of controller circuit 313 and the support circuits 312, 
314, 316, 317, and 308 are shown in Figures 5A-5C. In the preferred 
embodiment of the present system, the main controller is a Z80180 8-bit 

1 5 microprocessor chip. In the preferred implementation, microcontroller chip Ul 7 
is a Z80180 microprocessor, part number Z84C01 by Zilog, Inc. of Campbell, 
California (also available from Hitachi Semiconductor as part number 
HD64180Z). The Zilog Z80180 8-bit microprocessor operates at 12 MHz 
intemal clock speed by means of an extemal crystal XTAL, which, in the 

20 preferred embodiment, is a 24.576 MHz crystal. The crystal circuit includes 

capacitors C4 and C5, which are 20 pf capacitors, and resistor R28, which is a 33 
ohm resistor. The crystal and support circuitry is connected according to 
manufacturer's specifications found in the Zilog Intelligent Peripheral 
Controllers Data Book published by Zilog, Inc. The product description for the 

25 Z84C01 Z80180 CPU from the Z84C01 Z80 CPU Product Specification pgs. 43- 
73 of the Zilog 1991 Intelligent Peripheral Controllers databook is hereby 
incorporated by reference. 

The Z80180 microprocessor in microcontroller chip U17 is intimately 
connected to a serial/parallel I/O counter timer chip U15 which is, in the 

30 preferred embodiment, a Zilog 84C90 CMOS Z80 KIO 

serial/parallel/counter/timer integrated circuit available from Zilog, Inc. This 
multi-function I/O chip U15 combines the fiuictions of a parallel input/output 
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port, a serial input/output port, a bus control circuitry, and a clock timer circuit in 
one chip. The Zilog Z84C90 product specification describes the detailed internal 
operations of this circuit in the Zilog Intelligent Peripheral Controllers 1991 
Handbook available from Zilog, Inc. Z84C90 CMOS Z80KIO Product 
5 Specification pgs. 205-224 of the Zilog 1991 Intelligent Peripheral Controllers 
Data Book is hereby incorporated by reference. 

Data and address buses A and B shown in Figure 5 A connect the Z80180 
microprocessor in microcontroller U17 with the Z80 KIO circxiit U15 and a gate 
array circuit U19, and to other portions of the electrical schematic diagrams. The 

10 gate array U19 includes miscellaneous latch and buffer circuits for the present 
system which normally would be found in discrete SSI or MSI integrated 
circuits. By combining a wide variety of miscellaneous support circuits into a 
single gate array, a much reduced design complexity and manufacturing cost is 
achieved. A detailed description of the intemal operations of gate array U19 is 

1 5 described more fiilly below in conjvmction with schematic diagrams of Figures 
1 OA- IOC. 

The memory chips which operate in conjunction with the Z80 
microprocessor in microcontroller chip U17 are shown in Figure 5C. The 
connections A, B correspond to the connections to the address and data buses, 

20 respectively, found on Figure 5 A. Memory chips U16 and U13 are read-only 
memory (ROM) chips which are electrically alterable in place. These 
programmable ROMs, typically referred to as flash PROMs or Programmable 
Erasable Read-Only Memories (PEROMs), hold the program code and operating 
parameters for the present system in a non- volatile memory. Upon power-up, the 

25 programs and operating parameters are transferred to the voice control DSP 
RAM U12, shown in Figure 9B. 

In the preferred embodiment, RAM chip U 14 is a pseudostatic RAM 
which is essentially a dynamic RAM with a built-in refresh. Those skilled in the 
art will readily recognize that a wide variety of memory chips may be used and 

30 substituted for pseudo-static RAM U14 and flash PROMs U16 and U13. 

Referring once again to Figure 3, the main controller circuit 313 
communicates with the voice control DSP of circuit 306 through dual port RAM 
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circuit 308. The digital telephone CODEC circuit 305, the voice control DSP 
and CODEC circuit 306, the DSP RAM 307, and the dual port RAM 308 are all 
shown in detailed electrical schematic diagrams of Figures 9A-9C. 

Referring to Figure 9A, the DSP RAM chips U6 and U7 are shown with 
5 associated support chips. Support chips Ul and U2 are in the preferred 

embodiment part 74HCT244 which are TTL-level latches used to capture data 
from the data bxis and hold it for the DSP RAM chips U6 and U7. Circviits U3 
and U4 are also latch circuits for also latching address information to control 
DSP RAM chips U6 and U7. Once again, the address bus A and data bus B 
10 shown in Figure 9 A are multi-wire connections which, for the clarity of the 
drawing, are shown as a thick bus wire representing a grouping of individual 
wires. 

Also in Figure 9A, the DSP RAMs U6 and U7 are connected to the voice 
control DSP and CODEC chip U8 as shown split between Figures 9A and 9B. 

15 DSP/CODEC chip U8 is, in the preferred embodiment, part number fFE® 

DSP16C, digital signal processor and CODEC chip manufactured and sold by 
AT&T Microelectronics. This is a 16-bit programmable DSP with a voice band 
sigma-delta CODEC on one chip. Although the CODEC portion of this chip is 
capable of analog-to-digital and digital-to-analog signal acquisition and 

20 conversion system, the actual D/A and AID functions for the telephone interface 
occur in digital telephone CODEC chip U12 (corresponding to digital telephone 
CODEC circuit 305 of Figure 3). Chip U8 includes circuitry for sampling, data 
conversion, anti-aliasing filtering, and anti-imaging filtering. The progranmiable 
control of DSP/CODEC chip U8 allows it to receive digitized voice from the 

25 telephone interface (through digital telephone CODEC chip U12) and store it in 
a digitized form in the dual port RAM chip Ul 1 . The digitized voice can then be 
passed to the main controller circuit 313, where the digitized voice may be 
transmitted to the personal computer over the RS232 circuit 315. Li a similar 
fashion, digitized voice stored by the main controller circuit 3 13 in the dual port 

30 RAM Ul 1 may be transferred through voice control DSP chip U8, converted to 
analog signals by telephone CODEC U12, and passed to the user. Digital 
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telephone CODEC chip U12 includes a direct telephone handset interface on the 
chip. 

The connections to DSP/CODEC chip U8 are shown split across Figures 
9A and 9B. Address/data decode chips U9 and UIO on Figure 9A serve to 
5 decode address and data information from the combined address/data bus for the 
dual port RAM chip Ul 1 of Figure 9B. The interconnection of the DSP/CODEC 
chip U8 shown on Figures 9A and 9B is described more fully in the WE^ 
DSP16C Digital Signal Processor/CODEC Data Sheet published May, 1991, by 
AT&T Microelectronics, which is hereby incorporated by reference. 

10 The Digital Telephone CODEC chip U12 is also shown in Figure 9B 

which, in the preferred embodiment, is part number T7540 Digital Telephone 
CODEC manufactured and sold by AT&T Microelectronics. A more detailed 
description of this telephone CODEC chip U12 is described in the T7540 Digital 
Telephone CODEC Data Sheet and Addendum published July, 1991, by AT&T 

15 Microelectronics, which is hereby incorporated by reference. 

Support circuits shown on Figure 9C are used to facilitate 
communication between CODEC chip U12, DSP/CODEC chip U8, and dual 
port RAM Ul 1 . For example, an 8 kHz clock is used to synchronize the 
operation of CODEC U12 and DSP/CODEC U8. 

20 The operation of the dual port RAM Ul 1 is controlled both by DSP U8 

and main controller chip U17. The dual port operation allows writing into one 
address while reading from another address in the same chip. Both processors 
can access the exact same memory locations with the use of a contention 
protocol such that when one is reading, the other cannot be writing. In the 

25 preferred embodiment, dual port RAM chip Ul 1 is part number CYZC13 1 

available from Cyprus Semiconductor. This chip includes built-in contention 
control so that if two processors try to access the same memory location at the 
same time, the first one making the request gets control of the address location 
and the other processor must wait. In the preferred embodiment, a circular 

30 buffer is arranged in dual port RAM chip Ul 1 comprising 24 bytes. By using a 
circular buffer configuration with pointers into the buffer area, both processors 
will not have a contention problem. 
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The DSP RAM chips U6 and U7 are connected to the DSP chip U8 and 
also connected through the data and address buses to the Zilog microcontroller 
U17. In this configuration, the main controller can download the control 
programs for DSP U8 into DSP RAMs U6 and U7. In this fashion, DSP control 
5 can be changed by the main controller or the operating programs on the personal 
computer, described more fully below. The control programs stored in DSP 
chips U6 and U7 originate in the flash PEROM chips U16 and U17. The power- 
up control routine operating on controller chip U17 downloads the DSP control 
routines into DSP RAM chips U6 and U7. 

1 0 The interface between the main controller circuit 313 and the personal 

computer is through SIO circuit 314 and RS232 serial interface 315. These 
interfaces are described more fully in conjunction with the detailed electrical 
schematic diagrams of Figures 6A-6C. RS232 connection Jl is shown on Figure 
6A with the associated control circuit and interface circuitry used to generate and 

15 receive the appropriate RS232 standard signals for a serial communications 

interface with a personal computer. Figure 6B is a detailed electrical schematic 
diagram showing the generation of various voltages for powering the hardware 
components of the electrical schematic diagrams of hardware components 20. 
The power for the present hardware components is received on connector J5 and 

20 controlled by power switch S34. From this circmtry of Figure 6B, plus and 

minus 12 volts, plus five volts and minus five volts are derived for operating the 
various RAM chips, controller chips, and support circuitry of the present system. 
Figure 6C shows the interconnection of the status LEDs found on the front 
display of the box 20. 

25 Finally, the "glue logic" used to support various functions in the hardware 

components 20 are described in conjunction with the detailed electrical 
schematic diagrams of Figures 1 OA- IOC. The coimections between Figures lOA 
and IOC and the previous schematic diagrams is made via the labels for each of 
the lines. For example, the LED status lights are controlled and held active by 

30 direct addressing and data control of latches GAl and GA2. For a more detailed 
description of the connection of the glue logic of Figures 1 OA- IOC, the gate 
array U19 is shown connected in Figures 5 A and 5B. 
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Packet Protocol Between the PC and the Hardware Component 

A special packet protocol is used for communication between the 
hardware components 20 and the personal computer (PC) 10. The protocol is 
used for transferring different types of information between the two devices such 
as the transfer of DATA, VOICE, and QUALIFIED information. The protocol 
also uses the BREAK as defined in CCITT X.28 as a means to maintain protocol 
synchronization. A description of this BREAK sequence is also described in the 
Statutory Invention Registration entitled, "Escape Methods for Modem 
Communications," to Timothy D. Gunn, filed January 8, 1993, which is hereby 
incorporated by reference. 

The protocol has two modes of operation. One mode is packet mode, and 
the other is stream mode. The protocol allows mixing of different types of 
information into the data stream without having to physically switch modes of 
operation. The hardware component 20 will identify the packet received from 
the computer 10 and perform the appropriate action according to the 
specifications of the protocol. If it is a data packet, then the controller 3 13 of 
hardware component 20 would send it to the data pump circuit 311. If the packet 
is a voice packet, then the controller 313 of hardware component 20 would 
distribute that information to the Voice DSP 306. This packet transfer 
mechanism also works in the reverse, where the controller 3 1 3 of hardware 
component 20 would give different information to the computer 10 without 
having to switch into different modes. The packet protocol also allows 
commands to be sent to either the main controller 313 directly or to the Voice 
DSP 306 for controlling different options without having to enter a command 
state. 

Packet mode is made up of 8-bit asynchronous data and is identified by a 
begiiming synchronization character (01 hex) followed by an ID/LI character and 
then followed by the information to be sent. In addition to the ID/LI character 
codes defined below, those skilled in the art will readily recognize that other 
ID/LI character codes could be defined to allow for additional types of packets, 
such as video data, or alternate voice compression algorithm packets, such as 
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Codebook Excited Linear Predictive Coding (CELP) algorithm, GSM, RPE, 
VSELP, etc. 

Stream mode is used when large amounts of one type of packet (V OICE, 
DATA, or QUALIFIED) is being sent. The transmitter tells the receiver to enter 
5 stream mode by a unique command. Thereafter, the transmitter tells the receiver 
to terminate stream mode by using the BREAK command followed by an "AT" 
type command. The command used to terminate the stream mode can be a 
command to enter another type of stream mode or it can be a command to enter 
back into packet mode. 
10 Currently, there are three types of packets used: DATA, VOICE, and 

QUALIFIED. Table 1 shows the common packet parameters used for all three 
packet types. Table 2 shows the three basic types of packets with the sub-types 
listed. 

15 

TABLE 1: Packet Parameters 

20 1 . Asynchronous transfer 

2. 8 bits, no parity 

3. Maximum packet length of 128 bytes 

— IDentifier byte = 1 

— InFormation =127 
25 4. SPEED 

— variable from 9600 to 57600 

— default to 19200 

30 
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TABLE 2 Packet Types 



1. 
2. 
3. 



Data 

Voice 

Qualified: 

a. 

b. 

c. 

d. 

e. 

f. 

g- 
h. 



COMMAND 

RESPONSE 

STATUS 

FLOW CONTROL 

BREAK 

ACK 

NAK 

STREAM 
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A Data Packet is shown in Table 1 and is used for normal data transfer 
between the controller 313 of hardware comp>onent 20 and the computer 10 for 
such things as text, file transfers, binary data, and any other type of information 
presently being sent through modems. All packet transfers begin with a synch 
character 01 hex (synchronization byte). The Data Packet begins with an ID byte 
which specifies the packet type and packet length. Table 3 describes the Data 
Packet byte structure, and Table 4 describes the bit structure of the ID byte of the 
Data Packet. Table 5 is an example of a Data Packet with a byte length of 6. 
The value of the LI field is the actual length of the data field to follow, not 
counting the ID byte. 
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TABLE 3: Data Packet Byte Structure 



byte 1 
byte 2 

bytes 3-127 = 



01 h (sync byte) 

ID/LI (ID byte/length indicator) 
data (depending on LI) 





01 


ID 










40 


SYNC 


LI 


data 


data 


data 


data 



data 
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TABLE 4: ID Byte of Data Packet 

Bit 7 identifies the type of packet 

Bits 6-0 contain the LI or length indicator portion of the 
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TABLE 5: Data Packet Example 

LI (length indicator) = 6 



01 


06 














SYNC 


ID 


data 


data 


data 


data 


data 


data 
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The Voice Packet is used to transfer compressed VOICE messages 
between the controller 3 1 3 of hardware component 20 and the computer 10. The 
Voice Packet is similar to the Data Packet except for its length, which is, in the 
preferred embodiment, currently fixed at 23 bytes of data. Once again, all 
packets begin with a synchronization character chosen in the preferred 
embodiment to be 01 hex (OIH). The ED byte of the Voice Packet is completely 
a zero byte: all bits are set to zero. Table 6 shows the ED byte of the Voice 
Packet and Table 7 shows the Voice Packet byte structure. 
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TABLE 6: ID Byte of Voice Packet 

5 4 3 2 
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TABLE 7: Voice Packet Byte Structure 

LI (length indicator) = 0 
23 bytes of data 



20 


01 


00 












SYNC 


ID 


data 


data 


data 


data 



data 



25 The Qualified Packet is ixsed to transfer commands and other non- 

data/voice-related information between the controller 313 of hardware 
component 20 and the computer 10. The various species or types of the 
Qualified Packets are described below and are listed above in Table 2. Once 
again, all packets start with a synchronization character chosen in the preferred 

30 embodiment to be 01 hex (OIH). A Qualified Packet starts with two bytes, 

where the first byte is the ED byte and the second byte is the QUALIFIER type 
identifier. Table 8 shows the ID byte for the Qualified Packet, Table 9 shows the 
byte structure of the Qualified Packet, and Tables 10-12 list the Qualifier Type 
byte bit maps for the three types of Qualified Packets. 

35 

TABLES: ID Byte of Qualified Packet 

40 7 6 5 4 3 2 
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The Length Identifier of the ID byte equals the amount of data which 
follows, including the QUALIFIER byte (QUAL byte + DATA). If LI = 1, then 
the Qualifier Packet contains the Q byte only. 



TABLE 9: Qualifier Packet Byte Structure 





01 


85 


QUAL 










10 


SYNC 


ID 


BYTE 


data 


data 


data 





data 



The bit maps of the Qualifier Byte (QUAL BYTE) of the Qualified 
15 Packet are shown in Tables 10-12. The bit map follows the pattem, whereby if 
the QUAL byte = 0, then the conmiand is a break. Also, bit 1 of the QUAL byte 
designates ack/nak, bit 2 designates flow control, and bit 6 designates stream 
mode command. Table 10 describes the Qualifier Byte of Qualified Packet, 
Group 1 , which are immediate commands. Table 1 1 describes the Qualifier Byte 
20 of Qualified Packet, Group 2, which are stream mode commands in that the 
command is to stay in the designated mode until a BREAK + INIT command 
string is sent. Table 12 describes the Qualifier Byte of Qualified Packet, Group 
3, which are information of status commands. 



30 TABLE 10: Qualifier Byte of Qualified Packet: Group 1 
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6 


5 


4 


3 


2 


1 


0 




X 


X 


X 


X 


X 


X 


X 


X 




0 


0 


0 


0 


0 


0 


0 


0 


= break 


0 


0 


0 


0 


0 


0 


1 


0 


= ACK 


0 


0 


0 


0 


0 


0 


1 


1 


= NAK 


0 


0 


0 


0 


0 


1 


0 


0 


= xofF or stop sending data 


0 


0 


0 


0 


0 


1 


0 


1 


= xon or resume sending 


0 


0 


0 


0 


1 


0 


0 


0 


= cancel fax 
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TABLE 11: Qualifier Byte of Qualified Packet: Group 2 
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7 6 
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4 


3 


2 


1 


0 






X X 


X 


X 


X 


X 


X 


X 






0 1 


0 


0 


0 


0 


0 
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= stream command mode 




0 1 


0 


0 


0 


0 


1 


0 


= stream data 


10 


0 1 


0 


0 


0 


0 


1 


1 


= stream voice 




0 1 


0 


0 


0 


1 


0 


0 


= stream video 




0 1 


0 


0 


0 


1 


0 


1 


= stream A 




0 1 


0 


0 


0 


1 


1 


0 


= stream B 




0 1 


0 


0 


0 


1 


1 


1 


= stream C 
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The Qualifier Packet indicating stream mode and BREAK attention is 
used when a large amount of information is sent (voice, data, ...) to allow the 
20 highest throughput possible. This command is mainly intended for use in DATA 
mode but can be used in any one of the possible modes. To change firom one 
mode to another, break-init sequence would be given. A break "AT...<cr>" type 
command would cause a change in state and set the serial rate fi"om the "AT" 
command. 
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TABLE 12: Qualifier Byte of Qualified Packet: Group 3 
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0 
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X 
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0 


0 


0 


0 


0 


0 


0 


= commands 
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0 


0 


0 


0 


0 


0 


1 


= responses 


40 


1 


0 


0 


0 


0 


0 


1 


0 


= status 



Cellular Supervisory Packet 

45 In order to determine the status of the cellular link, a supervisor packet 

(shown in Table 13) is used. Both sides of the cellular link will send the cellular 
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supervisory packet every 3 seconds. Upon receiving the cellular supervisory 
packet, the receiving side will acknowledge it using the ACK field of the cellular 
supervisory packet. If the sender does not receive an acknowledgement within 
one second, it will repeat sending the cellular supervisory packet up to 12 times. 
5 After 12 attempts of sending the cellular supervisory packet without an 

acknowledgement, the sender will disconnect the line. Upon receiving an 
acknowledgment, the sender will restart its 3 second timer. Those skilled in the 
art vdll readily recognize that the timer values and wait times selected here may 
be varied v^thout departing from the spirit or scope of the present invention. 
10 

TABLE 13: Cellular Supervisory Packet Byte Structure 



8F 


ID 


LI 


ACK 


data 


data 



Speech Compression 

20 The Speech Compression algorithm described above for use in the voice 

mail function, the multi-media mail function, and the show-and-tell function of 
the present system is all accomplished via the voice control circuit 306. 
Referring once again to Figure 3, the user is talking either through the handset, 
the headset, or the microphone/speaker telephone interface. The analog voice 

25 signals are received and digitized by the telephone CODEC circuit 305. The 

digitized voice information is passed from the digital telephone CODEC circuit 
305 to the voice control circuits 306. The digital signal processor (DSP) of the 
voice control circuit 306 is progranmied to do the voice compression algorithm. 
The source code programmed into the voice control DSP is attached in the 

30 microfiche appendix. The DSP of the voice control circuit 306 compresses the 
speech and places the compressed digital representations of the speech into 
special packets described more fully below. As a result of the voice compression 
algorithm, the compressed voice information is passed to the dual port RAM 
circuit 308 for either forwarding and storage on the disk of the personal 

35 computer via the RS232 serial interface or for multiplexing v^th conventional 
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modem data to be transmitted over the telephone Ime via the telephone line 
interface circuit 309 in the voice over data mode of operation show-and-tell 
function 123. 

5 Speech Compression Algorithm 

To multiplex high-fidelity speech with digital data and transmit both over 
the over the telephone line, a high available bandwidth would normally be 
required. In the present invention, the analog voice information is digitized into 
8-bit PCM data at an 8 kHz sampling rate producing a serial bit stream of 64,000 

10 bps serial data rate. This rate cannot be transmitted over the telephone line. 
With the Speech Compression algorithm described below, the 64 kbs digital 
voice data is compressed into a 9200 bps encoding bit stream using a fixed-point 
(non-floating point) DSP such that the compressed speech can be transmitted 
over the telephone line using a 9600 baud modem transmission. This is an 

15 approximately 7:1 compression ratio. This is accomplished in an efficient 
manner such that enough machine cycles remain during real-time speech 
compression to allow real-time acoustic and line echo cancellation in the same 
fixed-point DSP. 

Even at 9200 bps serial data rate for voice data transmission, this bit rate 
20 leaves little room for concurrent conventional data transmission. A silence 
detection function is used to detect quiet intervals in the speech signal and 
substitute conventional data packets in lieu of voice data packets to effectively 
time multiplex the voice and data transmission. The allocation of time for 
conventional data transmission is constantly changing depending upon how 
25 much silence is on the voice channel. 

The voice compression algorithm of the present system relies on a model 
of human speech which shows that human speech contains redimdancy inherent 
in the voice pattems. Only the incremental innovations (changes) need to be 
transmitted. The algorithm operates on 160 digitized speech samples (20 
30 milliseconds), divides the speech samples into time segments of 5 milliseconds 
each, and uses predictive coding on each segment. With this algorithm, the 
current segment is predicted as best as possible, based on the past recreated 
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segments, and a diflference signal is determined. The difference value is 
compared to the stored difference values in a look-up table or code book, and the 
address of the closest value is sent to the remote site along with the predicted 
gain and pitch values for each segment. In this fashion, four 5ms speech 
5 segments can be reduced to a packet of 23 bytes or 1 84 bits (46 bits per sample 
segment). By transmitting 184 bits every 20 milliseconds, an effective serial 
data transmission rate of 9200 bps is accomplished. 

To produce this compression, the present system includes a xmique 
Vector Quantization (VQ) speech compression algorithm designed to provide 

1 0 maximum fidelity with minimum compute power and bandwidth. The VQ 
algorithm has two major components. The first section reduces the dynamic 
range of the input speech signal by removing short-term and long-term 
redxmdancies. This reduction is done in the waveform domain, with the 
synthesized part used as the reference for determining the incremental "new" 

15 content. The second section maps the residual signal into a code book optimized 
for preserving the general spectral shape of the speech signal. 

Figure 1 1 is a high level signal flow block diagram of the speech 
compression algorithm used in the present system to compress the digitized 
voice for transmission over the telephone line in the voice over data mode of 

20 operation or for storage and use on the personal computer. The transmitter and 
receiver components are implemented using the programmable voice control 
DSP/CODEC circuit 306 shown m Figure 3. 

The DC removal stage 1101 receives the digitized speech signal and 
removes the DC bias by calculating the long-term average and subtracting it 

25 from each sample. This ensures that the digital samples of the speech are 

centered about a zero mean value. The pre-emphasis stage 1 103 whitens the 
spectral content of the speech signal by balancing the extra energy in the low 
band with the reduced energy in the high band. 

The system finds the innovation in the current speech segment by 

30 subtracting 1 109 the prediction from reconstructed past samples synthesized 

from synthesis stage 1 107. This process requires the synthesis of the past speech 
samples locally (analysis by synthesis). The synthesis block 1 107 at the 
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transmitter performs the same function as the synthesis block 1 1 13 at the 
receiver. When the reconstructed previous segment of speech is subtracted from 
the present segment (before prediction), a difference term is produced in the 
form of an error signal. This residual error is used to find the best match in the 
5 code book 1 105. The code book 1 105 quantizes the error signal using a code 
book generated from a representative set of speakers and environments. A 
minimum mean squared error match is determined in 5ms segments. In addition, 
the code book is designed to provide a quantization error with spectral roll-off 
(higher quantization error for low frequencies and lower quantization error for 

10 higher frequencies). Thus, the quantization noise spectrum in the reconstructed 
signal v^ll always tend to be smaller than the underlying speech signal. 

The channel corresponds to the telephone line in which the compressed 
speech bits are multiplexed with data bits using a packet format described below. 
The voice bits are sent in 100ms packets of 5 frames each, each frame 

1 5 corresponding to 20ms of speech in 160 samples. Each frame of 20ms is fiirther 
divided into 4 sub-blocks or segments of 5ms each. In each sub-block of the 
data consists 7 bits for the long-term predictor, 3 bits for the long-term predictor 
gain, 4 bits for the sub-block gain, and 32 bits for each code book entry, for a 
total 46 bits each 5ms. The 32 bits for code book entries consists of four 8-bit 

20 table entries in a 256 long code book of 1 .25ms duration. In the code book 

block, each 1 .25ms of speech is looked up in a 256 word code book for the best 
match. The 8-bit table entry is transmitted rather than the actual samples. The 
code book entries are pre-computed from representative speech segments. (See 
the DSP Source Code in the microfiche appendix.) 

25 On the receiving end 1200, the synthesis block 1 1 13 at the receiver 

performs the same fimction as the synthesis block 1 107 at the transmitter. The 
synthesis block 1113 reconstructs the original signal from the voice data packets 
by using the gain and pitch values and code book address corresponding to the 
error signal most closely matched in the code book. The code book at the 

30 receiver is similar to the code book 1 105 in the transmitter. Thus, the synthesis 
block recreates the original pre-emphasized signal. The de-emphasis stage 1115 
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inverts the pre-emphasis operation by restoring the balance of original speech 
signal. 

The complete speech compression algorithm is summarized as follows: 

a. Remove any DC bias in the speech signal. 

5 

b. Pre-emphasize the signal. 

c. Find the iimovation in the current speech segment by 
subtracting the prediction from reconstructed past 

10 samples. This step requires the synthesis of the past 

speech samples locally (analysis by synthesis) such that 
the residual error is fed back into the system. 

d. Quantize the error signal using a code book generated 

1 5 from a representative set of speakers and envirormients. A 

minimum mean squared error match is determined in 5ms 
segments. In addition, the code book is designed to 
provide a quantization error with spectral roll-off (higher 
quantization error for lower frequencies and lower 

20 quantization error for higher frequencies). Thus, the 

quantization noise spectrum in the reconstructed signal 
will always tend to be smaller than the imderlying speech 
signal. 

25 e. At the transmitter and the receiver, reconstruct the speech 

from the quantized error signal fed into the inverse of the 
function in step c above. Use this signal for analysis by 
synthesis and for the output to the reconstruction stage 
below. 
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f. Use a de-emphasis filter to reconstruct the output. 



The major advantages of this approach over other low bit rate algorithms 
are that there is no need for any complicated calculation of reflection coeflRcients 

35 (no matrix inverse or lattice filter computations). Also, the quantization noise in 
the output speech is hidden under the speech signal and there are no pitch 
tracking artifacts: the speech sounds "natural," with only minor increases of 
background hiss at lower bit rates. The computational load is reduced 
significantly compared to a VSELP algorithm, and variations of the same 

40 algorithm provide bit rates of 8, 9.2, and 16 Kbits/s. The total delay through the 
analysis section is less than 20 milliseconds in the preferred embodiment. The 
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present algorithm is accomplished completely in the waveform domain and there 
is no spectral information being computed and there is no filter computations 
needed. 

5 Detailed Descriprion of the Speech Compression Algorithm 

The speech compression algorithm is described in greater detail with 
reference to Figures 1 1 through 13, and with reference to the block diagram of 
the hardware components of the present system shown at Figure 3. Also, 
reference is made to the detailed schematic diagrams in Figures 9A-9C. The 

10 voice compression algorithm operates within the programmed control of the 

voice control DSP circuit 306. In operation, the speech or analog voice signal is 
received through the telephone interface 301, 302, or 303 and is digitized by the 
digital telephone CODEC circuit 305. The CODEC for circuit 305 is a 
companding |i-law CODEC. The analog voice signal fi-om the telephone 

15 interface is band-limited to about 3,500 Hz and sampled at 8 kHz by digital 

telephone CODEC 305. Each sample is encoded into 8-bit PCM data producing 
a serial 64kb/s signal. The digitized samples are passed to the voice control 
DSP/CODEC of circuit 306. There, the 8-bit jx-law PCM data is converted to 
13 -bit linear PCM data. The 13 -bit representation is necessary to accurately 

20 represent the linear version of the logarithmic 8-bit |i-law PCM data. With 
linear PCM data, simpler mathematics may be performed on the PCM data. 

The voice control DSP/CODEC of circuit 306 correspond to the single 
integrated circuit U8 shown in Figures 9A and 9B as a W£^ DSP16C Digital 
Signal Processor/CODEC fi-om AT&T Microelectronics, which is a combined 

25 digital signal processor and a linear CODEC in a single chip, as described above. 
The digital telephone CODEC of circuit 305 corresponds to integrated circuit 
U12, shown in Figure 9B as a T7540 companding |i-law CODEC. 

The sampled and digitized PCM voice signals from the telephone (i-law 
CODEC U12 shown in Figure 9B are passed to the voice control DSP/CODEC 

30 U8 via direct data lines clocked and synchronized to an 8 kHz clocking 

frequency. The digital samples are loaded into the voice control DSP/CODEC 
U8 one at a time through the serial input and stored into an internal queue held in 
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RAM and converted to linear PCM data. As the samples are loaded into the end 
of the queue in the RAM of the voice control DSP U8, the samples at the head of 
the queue are operated upon by the voice compression algorithm. The voice 
compression algorithm then produces a greatly compressed representation of the 
5 speech signals in a digital packet form. The compressed speech signal packets 
are then passed to the dual port RAM circuit 308 shown in Figure 3 for use by 
the main controller circuit 3 1 3 for either transferring in the voice over data mode 
of operation or for transfer to the personal computer for storage as compressed 
voice for functions, such as telephone answering machine message data, for use 

10 in the multi-media documents, and the like. 

In the voice over data mode of operation, voice control DSP/CODEC 
circuit 306 of Figure 3 will be receiving digital voice PCM data from the digital 
telephone CODEC circuit 305, compressing it, and transferring it to dual port 
RAM circuit 308 for multiplexing and transfer over the telephone line. This is 

1 5 the transmit mode of operation of the voice control DSP/CODEC circuit 306 
corresponding to transmitter block 1 100 of Figure 1 1 and corresponding to the 
compression algorithm of Figure 12. 

Concurrent with this transmit operation, the voice control DSP/CODEC 
circuit 306 is receiving compressed voice data packets from dual port RAM 

20 circviit 308, uncompressing the voice data, and transferring the xmcompressed 
and reconstructed digital PCM voice data to the digital telephone CODEC 305 
for digital-to-analog conversion and eventual transfer to the user through the 
telephone interface 301, 302, or 304. This is the receive mode of operation of 
the voice control DSP/CODEC circuit 306 corresponding to receiver block 1200 

25 of Figure 1 1 and corresponding to the decompression algorithm of Figure 13. 

Thus, the voice control DSP/CODEC circuit 306 is processing the voice data in 
both directions in a ftill-duplex fashion. 

The voice control DSP/CODEC circuit 306 operates at a clock frequency 
of approximately 24.576 Mhz, while processing data at sampling rates of 

30 approximately 8 kHz in both directions. The voice compression/decompression 
algorithms and packetization of the voice data is accomplished in a quick and 
efficient fashion to ensure that all processing is done in real-time without loss of 
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voice information. This is accomplished in an efficient manner such that enough 
machine cycles remain in the voice control DSP circuit 306 during real-time 
speech compression to allow real-time acoustic and line echo cancellation in the 
same fixed point DSP. 
5 In programmed operation, the availability of an 8-bit sample of PCM 

voice data firom the |j.-law digital telephone CODEC circuit 305 causes an 
interrupt in the voice control DSP/CODEC circuit 306 where the sample is 
loaded into internal registers for processing. Once loaded into an internal 
register, it is transferred to a RAM address, which holds a queue of samples. 

10 The queued PCM digital voice samples are converted from 8-bit |x-law data to a 
13 -bit linear data format using table look-up for the conversion. Those skilled in 
the art will readily recognize that the digital telephone CODEC circuit 305 could 
also be a linear CODEC. 

Referring to Figure 11, the digital samples are shown as speech entering 

15 the transmitter block 1 100. The transmitter block, of course, is the mode of 

operation of the voice control DSP/CODEC circuit 306 operating to receive local 
digitized voice information, compress it, and packetize it for transfer to the main 
controller circuit 3 1 3 for transmission on the telephone line. The telephone line 
connected to telephone line interface 309 of Figure 3 corresponds to the channel 

20 nil of Figure 11. 

A frame rate for the voice compression algorithm is 20 milliseconds of 
speech for each compression. This correlates to 160 samples to process per 
frame. When 160 samples are accumulated in the queue of the internal DSP 
RAM, the compression of that sample frame is begun. 

25 The voice control DSP/CODEC circuit 306 is programmed to first 

remove the DC component 1101 of the incoming speech. The DC removal is an 
adaptive function to establish a center base line on the voice signal by digitally 
adjusting the values of the PCM data. The formula for removal of the DC bias 
or drift is as follows: 

32735 

S(n) = x{n) - x(w - 1) + a * Sin - 1), where a = ^^^^^ 
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The removal of the DC is for the 20 millisecond frame of voice, w^hich 
amoimts to 160 samples. The selection of a is based on empirical observation to 
provide the best result. 

Referring to Figure 12, the voice compression algorithm in a control flow 
5 diagram is shown, which will assist in the understanding of the block diagram of 
Figure 11. The analysis and compression begin at block 1201, where the 13 -bit 
linear PCM speech samples are accumulated until 160 samples representing 20 
milliseconds of voice or one frame of force is passed to the DC removal portion 
of code operating v^thin the programmed voice control DSP/CODEC circuit 
10 306. The DC removal portion of the code described above approximates the 
base line of the frame of voice by using an adaptive DC removal technique. 

A silence detection algorithm 1205 is also included in the programmed 
code of the DSP/CODEC 306. The silence detection function is a summation of 
the square of each sample of the voice signal over the frame. If the power of the 
15 voice frame falls below a pre-selected threshold, this would indicate a silent 
frame. The detection of a silence frame of speech is important for later 
multiplexing of the V-data and C-data described below. During silent portions 
of the speech, the main controller circuit 313 will transfer conventional digital 
data (C-data) over the telephone line in lieu of voice data (V-data). The formula 
20 for computing the power is: 

PWR= zlsinhSin) 

If the power PWR is lower than a pre-selected threshold, then the present 
25 voice frame is flagged as containing silence (see Table 15). The 160-sample 

silent frame is still processed by the voice compression algorithm; however, the 
silent frame packets are discarded by the main controller circuit 313 so that 
digital data may be transferred in lieu of voice data. 

The rest of the voice compression is operated upon in segments where 
30 there are four segments per frame amounting to 40 samples of data per segment. 
It is only the DC removal and silence detection which is accomplished over an 
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entire 20 millisecond frame. The pre-emphasis 1207 of the voice compression 
algorithm, shown in Figure 12, is the next step. The formula for the pre- 
emphasis is: 

Sin) = Sin)- r*Sin-lX where r=0.55 

Each segment thus amounts to five milliseconds of voice, which is equal 
to 40 samples. Pre-emphasis then is done on each segment. The selection of ris 
based on empirical observation to provide the best result. 

The pre-emphasis essentially flattens the signal by reducing the dynamic 
range of the signal. By using pre-emphasis to flatten the dynamic range of the 
signal, less of a signal range is required for compression, making the 
compression algorithm operate more efficiently. 

The next step in the speech compression algorithm is the long-term 
predictor (LTP). The long-term prediction is a method to detect the innovation 
in the voice signal. Since the voice signal contains many redundant voice 
segments, we can detect these redundancies and only send information about the 
changes in the signal from one segment to the next. This is accomplished by 
comparing the linear PCM data of the current segment on a sample by sample 
basis to the reconstructed linear PCM data from the previous segments to obtain 
the innovation information and an indicator of the error in the prediction. 

The first step in the long-term prediction is to predict the pitch of the 
voice segment, and the second step is to predict the gain of the pitch. For each 
segment of 40 samples, a long-term correlation lag PITCH and associated LTP 
gain factor fij (where 7-O, 1,2,3 corresponding to each of the four segments of 
the frame) are determined at 1209 and 1211, respectively. The computations are 
done as follows. 

From MINIMUM PITCH (40) to MAXIMUM PITCH (120) for indices 
40 through 120 (the pitch values for the range of previous speech viewed), the 
voice control DSP circuit 306 computes the cross-correlation between the current 
speech segment and the previous speech segment by comparing the samples of 
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the current speech segment against the reconstructed speech samples of the 
previous speech segment using the following formula: 



1=0 



where 7 = 40, ... 120 

S = current sample of current segment 
S past sample of reconstructed previous 



segment 



rtj^ = 0, 40, 80, 120 (the subframe index) 

10 and where the best fit is: 

Sxy = MAX{Sxy{j)}, where J = 40,... 120. 

The value of j for which the peak occurs is the PITCH. This is a 7-bit 
value for the current segment calculated at 1209. The value of j is an indicator of 
the delay or lag at which the cross-correlation matches the best between the past 

15 reconstructed segment and the current segment. This indicates the pitch of the 
voice in the current frame. The maximum computed value of j is used to reduce 
the redundancy of the new segment compared to the previous reconstructed 
segments in the present algorithm, since the value of j is a measure of how close 
the current segment is to the previous reconstructed segments. 

20 Next, the voice control DSP circuit 306 computes the LTP gain factor j3 

at 1211 using the following formula, in which Sxy is the current segment and So: 
is the previous reconstructed segment: 

Sxy(j) 
Sxx(j) 



segment 



where Sxx = ^S'^{i + MAX _ PITCH -best _ pitch) 

1=0 

25 The value of the LTP gain factor /? is a normalized quantity between zero 

and unity for this segment, where >ff is an indicator of the correlation between the 
segments. For example, a perfect sine wave would produce a P which would be 
close to unity, since the correlation between the current segments and the 
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previous reconstructed segments should be almost a perfect match so the LTP 
gain factor is one. 

The LTP gain factor is quantized from a LTP Gain Table. This table is 
characterized in Table 14. 



TABLE 14: LTP Gain Quantization 

10 0.1 0.3 0.5 0.7 0.9 

<.... I |._... I |...„. I i_„.. I |..„.. I i„_> 

P=0 J9=l J3=2 J9=3 P=A p=5 

15 

The gain value of P is then selected from this table depending upon 
which zone or range P^egmem was found, as depicted in Table 14. For example, if 
Psegment ©q^al to 0.45, thcn P is selected to be 2. This technique quantizes the P 
■ J into a 3 -bit quantity. 

i rj20 Next, the LTP (Long-Term Predictor) filter function 1213 is computed. 

"'I The pitch value computed above is used to perform the long-term analysis 

1=0 filtering to create an error signal e(n). The normalized error signals will be 

i transmitted to the other site as an indicator of the original signal on a per sample 

basis. The filter function for the current segment is as follows: 

£ e{n) = Sin) -p^S'{n- pitch) 

l^^5 where w = 0, 1, ... 39 

Next, the code book search and vector quantization function 1215 is 
performed. First, the voice control DSP circuit 306 computes the maximum 
sample value in the segment with the formula: 

GAIN = 

where « = 0, 1, ... 39 

30 This gain is different than the LTP gain. This gain is the maximum 

amplitude in the segment. This gain is quantized using the GAIN table described 
in the DSP Source Code attached in the microfiche appendix. Next, the voice 
control DSP circuit 306 normalizes the LTP filtered speech by the quantized 
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GAIN value by using the maximum error signal | e(n) | (absolute value for e(n)) 
for the current segment and dividing this into every sample in the segment to 
normalize the samples across the entire segment. Thus, the e(n) values are all 
normalized to have values between zero and one, using the following: 



Each segment of 40 samples is comprised of four subsegments of 10 
samples each. The voice control DSP circuit 306 quantizes 10 samples of e(n) 
with an index into the code book. The code book consists of 256 entries (256 
addresses) with each code book entry consisting of 10 sample values. Every 
entry of 10 samples in the code book is compared to the 10 samples of each 
subsegment. Thus, for each subsegment, the code book address or index is 
chosen based on a best match between the 10-sample subsegment and the closest 
10-sample code book entry. The index chosen has the least difference, according 
to the following minimization formula: 



This comparison to find the best match between the subsegment and the 
code book entries is computationally intensive. A brute force comparison may 
exceed the available machine cycles if real-time processing is to be 
accomplished. Thus, some shorthand processing approaches are taken to reduce 
the computations required to find the best fit. The above formxxla can be 
computed in a shorthand fashion by pre-computing and storing some of the 
values of this equation. For example, by expanding out the above formula, some 
of the unnecessary terms may be removed and some fixed terms may be pre- 
computed: 



e{n) = e{n)/GAIN w = 0 ... 39 




where = the input vector of 10 samples, and 
= the code book vector of 10 samples 



= (V-2x,x-+3^,') 
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where is a constant so it may be dropped from the formula, and where the 
value of 14 yf niay be pre-computed and stored as the eleventh value in the 
code book so that the only real-time computation involved is the foUovsdng 
formula: 



Thus, for a segment of 40 samples, we will transmit 4 code book indexes 
corresponding to 4 subsegments of 10 samples each. After the appropriate index 
into the code book is chosen, the LTP filtered speech samples are replaced with 
the code book samples. These samples are then multiplied by the quantized 
GAIN in block 1217. 

Next, the inverse of the LTP filter function is computed at 1219: 

e{n) = e{n) + p*S'{n- pitch) « = 0, . . . , 39 
S'{i) = Sin) « = 40, ... 120; / = 0, ... (120-40) 
S*{i) = e{i) / = 0, ...40 

The voice is reconstructed at the receiving end of the voice over data link 
according to the reverse of the compression algorithm as shown as the 
decompression algorithm in Figure 13. The synthesis of Figure 13 is also 
performed in the compression algorithm of Figure 12 since the past segment 
must be synthesized to predict the gain and pitch of the current segment. 



The use of the speaker 304 and the microphone 303 necessitates the use 
of an acoustical echo cancellation algorithm to prevent feedback from destroying 
the voice signals. In addition, a line echo cancellation algorithm is needed no 
matter which telephone interface 301, 302, or 303/304 is used. The echo 
cancellation algorithm used is an adaptive echo canceller which operates in any 




Echo Cancellation Algorithm 



of the modes of operation of the present system whenever the telephone interface 
is operational, hi particular, the echo canceller is operational in a straight 
telephone connection and it is operational in the voice over data mode of 
operation. 

hi the case of a straight telephone voice connection between the 
telephone interface 301, 302, 303/304, and the telephone line interface 309 in 
communication with an analog telephone on the other end, the digitized PCM 
voice data from digital telephone CODEC 305 is transferred through the voice 
control DSP/CODEC circuit 306 where it is processed in the digital domain and 
converted back from a digital form to an analog form by the intemal linear 
CODEC of voice control DSP/CODEC cu-cuit 306. Smce digital telephone 
CODEC ckcuit 305 is a ^i-law CODEC and the mtemal CODEC to the voice 
control DSP/CODEC circuit 306 is a linear CODEC, a ^i-law to Imear 
conversion must be accomplished by the voice control DSP/CODEC circuit 306. 

hi addition, the sampling rate of digital telephone CODEC 305 is slightly 
less than the samplmg rate of the linear CODEC of voice control DSP/CODEC 
circuit 306, so a slight sampling conversion must also be accomplished. The 
sampling rate of digital telephone ja-law CODEC 305 is 8000 samples per 
second, and the sampling rate of the linear CODEC of voice control 
DSP/CODEC ckcuit 306 is 8192 samples per second. 

Referring to Figure 14, in conjimction with Figure 3, the speech or analog 
voice signal is received through the telephone interface 301, 302, or 303 and is 
digitized by the digital telephone CODEC circuit 305 in an analog-to-digital 
conversion 1401. The CODEC for circuit 305 is a companding |a-law CODEC. 
The analog voice signal from the telephone interface is band-limited to about 
3,500 Hz and sampled at 8 kHz with each sample encoded into 8-bit PCM data 
producing a serial 64kb/s signal. The digitized samples are passed to the voice 
control DSP of circuit 306 where they are immediately converted to 13 -bit linear 
PCM samples. 

Referring again to Figure 14, the PCM digital voice data y(n) from 
telephone CODEC circuit 305 is passed to the voice control DSP/CODEC circuit 
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306 where the echo estimate signal y(n) in the form of digital data is subtracted 
from it. The subtraction is done on each sample on a per sample basis. 

Block 1405 and 1421 are gain control blocks and g^, respectively. 
These digital gain controls are derived from tables for which the gain of the 
5 signal may be set to different levels depending upon the desired level for the 

voice signal. These gain levels can be set by the user through the level controls 
in the software, as shown in Figure 49. The gain on the digitized signal is set by 
multiplying a constant to each of the linear PCM samples. 

In an altemate embodiment, the gain control blocks and g^ may be 
10 controlled by sensing the level of the speaker's voice and adjusting the gain 

accordingly. This automatic gain control facilitates the operation of the silence 
detection described above to assist in the time allocation between multiplexed 
O data and voice in the voice over data mode of operation, 

Q In voice over data mode, the output of gain control block g^ is placed in a 

•;il 5 buffer for the voice compression/decompression algorithm 1425 instead of 
J sample rate converter 1407. The samples in this mode are accumulated, as 

i;Q described above, and compressed for multiplexing and transmission by the main 

controller 313. Also in voice over data mode, the gain control block 1421 
receives decompressed samples from the voice compression/decompression 

^ jo algorithm 1425 instead of sample rate converter 1423 for output. 

J f The echo canceller of Figure 14 uses a least mean square (LMS) method 

of adaptive echo cancellation. The echo estimate signal subtracted from the 
incoming signal at 1403 is determined by function 141 1. Function 141 1 is a FIR 
(finite impulse response) filter having in the preferred embodiment an impulse 
25 response which is approximately the length of delay through the acoustic path. 

The coefficients of the FIR filter are modeled and tailored after the acoustic echo 
path of the echo, taking into accovint the specific physical attributes of the box 
that the speaker 304 and microphone 303 are located in and the proximity of the 
speaker 304 to the microphone 303. Thus, any signal placed on to the speaker is 
30 sent through the echo cancellation fimction 1411 to be subtracted from the 

signals received by the microphone 303 after an appropriate delay to match the 
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delay in the acoustic path. The formula for echo repUcation of function box 
1411 is: 



1=0 



and the result of the subtraction of the echo cancellation signal f(n) from the 
microphone signal is: 

ein) = y(n)-y(n). 

The LMS coefficient function 1413 provides adaptive echo cancellation 
coefficients for the FIR filter of 141 1 . The signal is adjusted based on the 
following formula: 



J3* e{n) 

h, (w -f 1) = h,{n) + ^ x{n - i) 

where / = 0, ... A^-1 
N= # of TAPS 
n = Time Index 

A: = 1000 



The echo cancellation of functions 1415 and 1417 are identical to the 
functions of 1413 and 1411, respectively. The functions 1407 and 1423 of 
Figure 14 are sample rate conversions, as described above, due to the different 
sampling rates of the digital telephone CODEC circuit 305 and the voice control 
CODEC of circuit 306. 

Voice Over Data Packet Protocol 

As described above, the present system can transmit voice data and 
conventional data concurrently by using time multiplex technology. The 
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digitized voice data, called V-data, carries the speech information. The 
conventional data is referred to as C-data. The V-data and C-data multiplex 
transmission is achieved in two modes at two levels: the transmit and receive 
modes, and data service level and multiplex control level. This operation is 
5 shown diagrammatically in Figure 15. 

In transmit mode, the main controller circuit 313 of Figure 3 operates in 
the data service level 1505 to collect and buffer data from both the personal 
computer 10 (through the RS232 port interface 315) and the voice control DSP 
306. In multiplex control level 1515, the main controller circuit 313 multiplexes 
10 the data and transmits that data out over the phone line 1 523. In the receive 
mode, the main controller circuit 313 operates in the multiplex control level 
1515 to de-mxxltiplex the V-data packets and the C-data packets, and then 
□ operates in the data service level 1505 to deliver the appropriate data packets to 

J the correct destination: the personal computer 10 for the C-data packets, or the 

^15 voice control DSP circuit 306 for the V-data. 

J: 5 Transmit Mode 

- „ In transmit mode, there are two data buffers — the V-data buffer 1511 

i: ^ and the C-data buffer 1513 — implemented in the main controller RAM 316 and 

' 20 maintained by main controller 313. When the voice control DSP circuit 306 
i =J engages voice operation, it will send a block of V-data every 20ms to the main 

controller circuit 313 through dual port RAM circuit 308. Each V-data block has 
one sign byte as a header and 23 bytes of V-data, as described in Table 15 below. 
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TABLE 15: Compressed Voice Packet Structure 
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Where P„ = pitch (7 bits) where n = subframe nvimber 

= Beta (3 bits) 

G„ = Gain (4 bits) 

Vd = Voice data (4x8 bits) 
Effective Bit Rate = 184 bits/20 msec = 9200 bps 
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The sign byte header is transferred every frame from the voice control 
35 DSP to the controller 313. The sign byte header contains the sign byte which 
identifies the contents of the voice packet. The sign byte is defined as follows: 

00 hex = the following V-data contains silent sound 

01 hex = the following V-data contains speech information 
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If the main controller 3 13 is in transmit mode for V-data/C-data 
multiplexing, the main controller circuit 313 operates at the data service level to 
perform the following tests. When the voice control DSP circuit 306 starts to 
send the 23 -byte V-data packet through the dual port RAM to the main controller 
circuit 313, the main controller will check the V-data bxrffer to see if the buffer 
has room for 23 bytes. If there is sufficient room in the V-data buffer, the main 
controller will check the sign byte in the header preceding the V-data packet. If 
the sign byte is equal to one (indicating voice information in the packet), the 
main controller circuit 313 will put the following 23 bytes of V-data into the V- 
data buffer and clear the silence counter to zero. Then, the main controller 313 
sets a flag to request that the V-data be sent by the main controller at the 
multiplex control level. 

If the sign byte is equal to zero (indicating silence in the V-data packet), 
the main controller circxiit 313 will increase the silence counter by 1 and check if 
the silence coimter has reached 5. When the silence counter reaches 5, the main 
controller circuit 313 will not put the foUowdng 23 bytes of V-data into the V- 
data buffer and vsdll stop increasing the silence counter. By this method, the 
main controller circuit 313 operating at the service level will only provide non- 
silence V-data to the multiplex control level, while discarding silence V-data 
packets and preventing the V-data buffer from being overwritten. 

The operation of the main controller circuit 3 13 in the multiplex control 
level is to multiplex the V-data and C-data packets and transmit then through the 
same channel. At this control level, both types of data packets are transmitted by 
the HDLC protocol in which data is transmitted in synchronous mode and 
checked by CRC error checking. If a V-data packet is received at the remote end 
with a bad CRC, it is discarded, since 100% accuracy of the voice channel is not 
ensured. If the V-data packets were resent in the event of corruption, the real- 
time quality of the voice transmission would be lost. In addition, the C-data is 
transmitted following a modem data communication protocol, such as CCITT 
V.42. 

In order to identify the V-data block to assist the main controller circuit 
313 to multiplex the packets for transmission at this level and to assist the 
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remote site in recognizing and de-multiplexing the data packets, a V-data block 
is defined, which includes a maximum of five V-data packets. The V-data block 
size and the maximum number of blocks are defined as follows: 



The V-data block has higher priority to be transmitted than C-data to 
ensure the integrity of the real-time voice transmission. Therefore, the main 
controller circuit 313 will check the V-data bufifer first to determine whether it 
will transmit V-data or C-data blocks. If V-data buffer has V-data of more than 
69 bytes, a transmit block coxmter is set to 5 and the main controller circuit 313 
starts to transmit V-data fi-om the V-data buffer through the data pump circuit 
311 onto the telephone line. Since the transmit block coxmter indicates 5 blocks 
of V-data will be transmitted in a continuous stream, the transmission will stop 
either at finish of the 115 bytes of V-data, or if the V-data buffer is empty. If V- 
data buffer has V-data with a number more than 23 bytes, the transmit block 
counter is set to 1 and starts transmitting V-data. This means that the main 
controller circuit will only transmit one block of V-data. If the V-data buffer has 
V-data with less than 23 bytes, the main controller circuit services the 
transmission of C-data. 

During the transmission of a C-data block, the V-data bufifer condition is 
checked before transmitting the first C-data byte. If the V-data buffer contains 
more than one V-data packet, the current transmission of the C-data block will 
be terminated in order to handle the V-data. 



On the receiving end of the telephone line, the main controller circuit 313 
operates at the multiplex control level to de-multiplex received data to V-data 
and C-data. The type of block can be identified by checking the first byte of the 
incoming data blocks. Before receiving a block of V-data, the main controller 



The V-data block header = 80h; 
The V-data block size = 23; 



The maximimi V-data block size = 5; 



Receive Mode 



57 



circuit 313 will initialize a receive V-data byte counter, a back-up pointer, and a 
temporary V-data buffer pointer. The value of the receiver V-data byte counter 
is 23, the value of the receive block counter is 0, and the back-up pointer is set to 
the same value as the V-data receive buffer pointer. If the received byte is not 
5 equal to 80 hex (80h indicating a V-data packet), the receive operation will 

follow the current modem protocol since the data block must contain C-data. If 
the received byte is eqxial to 80h, the main controller circuit 313 operating in 
receive mode will process the V-data. For a V-data block received, when a byte 
of V-data is received, the byte of V-data is put into the V-data receive buffer, the 
10 temporary buffer pointer is increased by 1 , and the receive V-data counter is 

decreased by 1 . If the V-data counter is down to zero, the value of the temporary 
V-data buffer pointer is copied into the back-up pointer buffer. The value of the 
O total V-data counter is added v^th 23, and the receive V-data coimter is reset to 

SiS. 

^4 23. The value of the receive block coxmter is increased by 1 . A flag to request 

^15 service of V-data is then set. If the receive block counter has reached 5, the main 
" J controller circuit 313 will not put the incoming V-data into the V-data receive 

rrj buffer but throw it away. If the total V-data counter has reached its maximum 

value, the receiver will not put the incoming V-data into the V-data receive 
buffer but throw it away. 
^ |0 At the end of the block, which is indicated by receipt of the CRC check 

! J bytes, the main controller circuit 313 operating in the multiplex control level will 

not check the result of the CRC but instead v^U check the value of the receive V- 
data counter. If the value is zero, the check is finished; otherwise, the value of 
the back-up pointer is copied back into the current V-data buffer pointer. By this 
25 method, the receiver is insured to de-multiplex the V-data from the receiving 
channel 23 bytes at a time. The main controller circuit 313 operating at the 
service level in the receive mode will monitor the flag of request service of V- 
data. If the flag is set, the main controller circuit 313 will get the V-data fi-om 
the V-data buffer and transmit it to the voice control DSP circuit 306 at a rate of 
30 23 bytes at a time. After sending a block of V-data, it decreases 23 fi"om the 
value in the total V-data counter. 
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User Interface Description 

The hardware components of the present system are designed to be 
controlled by an extemal computing device, such as a personal computer. As 
described above, the hardware components of the present system may be 
controlled through the use of special packets transferred over the serial line 
interface between the hardware components and the personal computer. Those 
skilled in the art will readily recognize that the hardware components of the 
present systems may be practiced independent of the software components of the 
present systems and that the preferred software description described below is 
not to be taken in a limiting sense. 

The combination of the software components and hardware components 
described in the present patent application may conveniently be referred to as a 
Personal Commimications System (PCS). The present system provides for the 
following functions: 

1 . The control and hands-off operation of a telephone with a built-in 
speaker and microphone, 

2. Allowing the user to create outgoing voice mail messages with a 
voice editor, and logging incoming voice mail messages with a time and date 
stamp. 

3. Creating queues for outgoing faxes, including providing the 
ability for a user to send faxes from unaware applications through a print 
command; also allowing the user to receive faxes and log incoming faxes with a 
time and date stamp. 

4. Allowing a user to create multi-media messages with the message 
composer. The message can contain text, graphics, pictures, and sound 
segments. A queue is created for the outgoing multi-media messages, and any 
incoming multi-media messages are logged with a time and date stamp. 

5. Providing a way for a user to have a simultaneous data and voice 
connection over a single communication line. 

6. Providing terminal emulation by invoking an extemal terminal 
emulation program. 
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7. Providing address book databases for all outbound calls and 
queues for the telephone, voice mail, fax manager, multi-media mail, and show- 
and-tell functions. A user may also search through the database using a dynamic 
pruning algorithm keyed on order insensitive matches. 

Figure 16 shows the components of a computer system that may be used 
with the PCS. The computer includes a keyboard 101 by which a user may input 
data iato a system, a computer chassis 103 which holds electrical components 
and peripherals, a screen display 105 by which information is displayed to the 
user, and a pointiQg device 107, typically a mouse, with the system components 
logically connected to each other via intemal system bus within the computer. 
The PCS software runs on a central processing unit 109 within the computer. 

Figiu-e 17 reveals the high-level structure of the PCS software. A main 
menu function 1 1 1 is used to select the following subfunctions: set-up 113, 
telephone 115, voice mail 1 17, fax manager 119, multi-media mail 121, show- 
and-tell 123, terminal 125, and address book 127. 

The preferred embodiment of the present system currently runs under 
Microsoft Windows® software running on an IBM® personal computer or 
compatible. However, it will be recognized that other implementations of the 
present inventions are possible on other computer systems and windowing 
software without loss of scope or generality. 

Figure 1 8 describes the control structure of the main menu 1 1 1 in greater 
detail. A timer 131 sends a timing signal to a control block 129 in order to make 
the control block 129 active substantially once every 10 seconds. It will be 
recognized that other timing intervals may be used, as appropriate to the 
windowing system being used, without loss of generality. A status 133 is used to 
preclude other applications or program blocks from taking control of a 
communications port by indicating that the port is currently being used by the 
PCS. The controller 129 looks at all outbovind queues in voice mail 1 17, fax 
manager 119, and multi-media mail 121, and if there is an outgoing message in 
one of the outboimd queues, initiates a dispatch. A signal is then sent to the 
status box in order to preclude other applications or program blocks from using 
the serial communications port. 
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The control block 129 also monitors incoming calls and invokes the 
appropriate program block, either voice mail 117, fax manager 119, multi-media 
mail 121, or show-and-tell 123, in order to further process the incoming call. 
Additionally, the control block 129 is used to invoke telephone functions 115, 
5 terminal emulation functions 125, and allow users to edit the database of 

addresses with the address book function 127. The control block 129 further 
provides for the initialization of PCS parameters via the set-up function 113. 
The main menu, as it is displayed to the user, is shovra in Figure 2. 

Figure 19 illustrates the structure of control block 129. Upon selecting 
10 the set-up function 1 13, the user has access to initialization functions 135 which 
include serial port, answer mode, hold call, voice mail, PBX, fax, multi-media 
mail, and show-and-tell initializations. Upon selecting telephone 115, the user 
tJ has access to telephone functions 137 which include equipment select, volume 

^ J control, and call functions, as shown in the screen display of Figure 49. Upon 

J; 4 5 selecting voice mail 117, voice mail functions 139 are provided which include a 
voice editor, voice messages to be sent, and voice messages received, as shown 
|:ri in the screen display of Figure 50. Upon selecting fax manager 119, fax manager 

functions 141 are provided which include set-up functions, faxes to be sent, and 

; faxes to be received, as shown in the screen display of Figure 52. If multi-media 

f [j 

,,20 mail 121 is selected, multi-media mail functions 143 are provided which include 
J 2 the set-up function, multi-media messages to be sent, and multi-media messages 

received functions, as illustrated by the screen display shown in Figure 53. If 
show-and-tell 123 is selected, show-and-tell functions 145 are provided to the 
user which include open, select new, and help, as illustrated in the screen display 
25 of Figure 54. If the terminal function 125 is selected by the user, the terminal 

emulation function 147 is provided to the user via a terminal emulation block. If 
address book 127 is selected, address book functions 149 are provided to the user 
which include file functions, edit functions, and help functions, as shown in the 
screen display of Figure 55. 
30 The set-up functions 135 are accessed by an initialization menu, as 

shown in the screen display of Figure 40. The PCS software provides support 
for any communications port that is contained within the personal computer. 
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Figure 41 shows the screen display shown to a xiser to enable a user to select a 
specific communications port to be used by the PCS software. A user may also 
specify what action is to be taken by the PCS software when an incoming call 
arrives. The screen display of Figure 42 is shown to the user, while Figure 20 
describes the fixll control of the answer mode set-up initialization procedure. 
Upon selecting answer mode set-up 151, the user is presented with eight answer 
mode choices 153 and described as follows: 

1 . PCS does not answer. The PCS software does not answer an 
incoming call and the telephone equipment acts as normal. 

2. Voice mail. The PCS software answers the incoming call and 
acts as an answering machine to record messages. 

3. Fax. The PCS software answers the incoming calls and acts as a 
fax machine. 

4. Multi-media mail. The PCS software answers the incoming call 
and receives multi-media mail that is being sent by a remote caller. 

5. Show-and-tell. The PCS software enables simultaneous data and 
voice commimication using the same communication line. 

6. Terminal. The PCS provides terminal emulation through a 
terminal emulation block, or optionally transfers control to a third-party terminal 
emulation program. 

7. Automatic. The incoming call is analyzed and the appropriate 
mode is automatically entered. 

The user may additionally enter a numeric value which represents the 
number of rings to wait before the PCS software answers an incoming call. 

If, fi^om the set-up functions 135, the hold call fimction is selected, the 
hold call display, as illustrated in Figure 43, is shown to the user, who may then 
enter a numeric value to specify the number of hours all outgoing calls are to be 
held. If, at set-up functions 135, the voice mail set-up option is selected, the 
screen display, as illustrated in Figure 44, is displayed to the user, who may then 
enter a file name to be used as a greeting file for all incoming calls. If, at set-up 
functions 135, the PBX set-up function is selected, the screen display of Figure 
45 is shown to the user, who may then enter a dialing prefix to be applied to any 
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outgoing telephone number. This provides for an easy way to use a list of 
telephone numbers with an in-house PBX system without making modifications 
to the telephone number list. If, at set-up functions 135, the fax set-up function 
is selected, the screen display of Figure 46 is displayed to the user, who may then 
5 enter a numeric value which represents the number of times to attempt to send a 
fax before timing out. If, at set-up functions 135, the multi-media mail set-up 
function is selected, the display of Figure 47 is shown to the user, who may then 
enter the name of a file to be used for user information. If, at set-up functions 
135, the show-and-tell function is selected, the screen display of Figure 48 is 
10 shown to the user, who may then enter show-and-tell user information. 

Figure 49 shows the telephone control function display as shown to the 
user. Figure 21 further illustrates the steps and options 155 available when the 
^ 3 telephone control function 1 15 is selected. The user may use the mouse to select 

■ J between a speaker phone, a handset, or a headset to be used with the 

;;]l5 communications device, and may adjust the volume of the speaker with a first 

logical slider switch, and the gain of the microphone with a second logical slider 
i;0 switch (both slider switches being displayed on the screen). During a call, the 

user may select to save a telephone number, redial a telephone number, record a 
call, flash between calls, mute a call, or place a call on hold. 

I if 

:,20 Figure 22 shows the voice mail functions 157 that are available upon 

selecting voice mail 117. A user may set-up a voice mail greeting file or may 
edit a voice mail message by selecting the voice mail editor, as shown in Figure 
50. The PCS software provides for two voice mail queues: the first, for voice 
mail messages to be sent, and the second, for voice mail messages received. In 

25 the send queue, a user may add messages to the queue, listen to messages in the 
queue, delete messages fi-om the queue, or refi-esh the send queue display. With 
the receive queue, a user may listen to messages in the queue, store messages in 
the queue, delete messages from the queue, forward messages from the queue to 
another voice mail user, or refi*esh the queue. The voice mail editor, as shown to 

30 the user in Figure 51, allows the user to select file functions to open or save a 
voice mail message; edit functions for editing a voice mail message; playing, 
recording, and pausing a voice mail message; and adjusting the play volume and 
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the record volume, llie user may also optionally select between a speaker 
phone, headset, and handset. 

Figure 52 illustrates the fax manager display as shown to the user. Figure 
23 illustrates the fax manager functions 159 that are available to a user after 
5 selecting the fax manager function 119 from the main menu 111. The fax 

manager function provides for two queues: the first, for faxes to be sent; and the 
second, for faxes that are received. When reviewing the first queue of faxes to 
be sent, the user may preview a fax, print a fax, delete a fax from the queue, 
refresh the send fax queue, or forward a fax to another user. When reviewing the 
10 second queue of received faxes, the user may view a fax, print a fax, delete a fax, 
refresh the received fax queue, forward a fax to another user, or store a fax. 

Figure 53 describes the multi-media mail display that is shown to the 

0 user. Figure 24 describes the multi-media mail functions 161 that are available 
: J to a user upon selecting the multi-media mail function 121 . Upon selecting 
l'^5 multi-media mail, the PCS software provides for set-up, composing a message 

with the message composer, or allowing the user to view and edit multi-media 
i;ri messages in two queues: the first queue, having multi-media messages to be 

sent; and the second queue, having multi-media messages that have been 

1 received. When reviewing the first queue of send messages, a user may add 
,,£0 messages to the queue, preview messages in the queue, delete messages from the 
: queue, or change attributes of messages in the queue. When a user is accessing 

the second queue of multi-media messages that have been received, the user may 
view messages, store messages, delete messages, forward messages to another 
user, or refresh the queue. 

25 Figure 54 illustrates the display shown to the user upon selecting the 

show-and-tell function 123 from the main menu 111. Figure 55 illustrates the 
display that is shown to the user upon selecting the address book function 127 
from the main menu 1 1 1 . A user may open a previously stored address book file 
or may edit an existing address book file by adding, deleting, or changing entries 

30 that are in the file. Additionally, the PCS software provides for a user to search 
through the database by using a dynamic pruning algorithm keyed on order 
insensitive matches. As the user enters a search string at a dialogue box, the list 
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of matches displayed is automatically updated in real-time to correspond to as 
much of the search string that has already been entered. The list continues to be 
updated until the search string has been completely entered. 

Software Control Description 

The preferred embodiment of the software control system of the present 
invention runs under Microsoft Windows software on an IBM PC or compatible. 
It will be recognized that other software implementations are available on other 
types of computers and windowing systems without loss of generality. 

Figure 25 shows the timing loop 131 of Figure 18 in greater detail. In 
order to process pending actions, the timer 131 checks the fax out queue at 163, 
the multi-media out queue at 165, the voice mail out queue at 167, and the 
communications port at 169. The three output queues and the commimications 
port are checked substantially once every 10 seconds to determine if there are 
any pending actions to be performed. If the timer does find a pending action in 
one of the queues, or if there is information coming in from the communications 
port, a secondary timer of duration 100 milliseconds is spawned in order to 
handle each pending action. This polling of the output queues continues as long 
as the main PCS software is active and running. The polling interval rate of 
substantially 10 seconds is short enough such that there is no significant time 
delay in handling any pending action in any of the output queues. It will be 
recognized that short timing intervals other than substantially 10 seconds may be 
used without loss of generality. 

Figure 38 illustrates the flow control for the outgoing queue timer. The 
incoming timer is stopped at 3701 and the communications port is initialized at 
3703. A job record 3707 is used to determine the destination for this message 
and dial the telephone at 3705. After a protocol handshake at 3709, an output 
page is sent at 371 1, which may include a fax code file 3713. The remaining 
number of pages to be sent is checked at 371 5, and if there are more pages, 
control retums to 3709 so that the additional page or pages can be sent to the 
destination. Otherwise, if at 3715 there are no more pages to be sent, the job 
record 3707 is updated at 3717, and the incoming time is restarted at 3719. 
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Figure 39 illustrates the flow control for the incoming queue timer. At 
3801, the communications port is initialized and the software waits for a ring 
indicator from an incoming call at 3803. After receiving a ring, the software 
answers, establishes a connection via a protocol handshake at 3805, and receives 
5 input data at 3807. If at 3809 there is more data to be received, control passes 
back to 3805 so that the pending data may be received. Otherwise, if at 3809 
there is no more data to be received, the call is terminated at 381 1. 

Figure 26 shows the control software used for a hands-ofif telephone. The 
telephone control software is invoked upon selection of the telephone option 
10 115, shown in Figure 2. Upon selection of the telephone option at 2501, any 

timers that have been spawned or are currently miming are disabled, and at 2503, 
the commimications port for the personal computer is initialized. At 2505, the 

Q telephone control software handles any options that have been selected by the 

user as displayed to the user by Figure 49. At 2507, the user may logically select 

\J\5 between a headset, handset, or speaker phone; at 2509, the user may adjust the 

Sj volimie level of the speaker or the gain of the microphone; and at 25 1 1 , the user 

may select between mute, hold, record, and redial fimctions for the hands-off 
telephone. After the user selects options at 25 13, a telephone number is dialed 

|,=i= and the call initiated. After the call is complete, at 2515 the user hangs up, and 

^ 20 at 2517 any timers that had been stopped at 2503 are restarted and at 2519 the 

I --^ communications port is restored to its previous state. 

Figure 27 shows the voice mail control software that is invoked by option 
117 from Figure 2. The user may select either a new file or record at 2601 , open 
an existing file at 2613, or abort at 2625. Upon selecting a file or record at 2603, 

25 the file may be saved at 261 1 , or the user may select options at 2605. The 
selectable options include setting the volume or record levels at 2607, or 
selecting between a handset, a headset, or a speaker phone at 2609, as shown to 
the user by the voice mail editor display given in Figure 51. If an existing file is 
opened at 2613, the file name is selected at 261 5, whereupon a user may then 

30 play the file at 261 7 or select options at 2619. The selectable options from 2619 
include setting the volume or record levels at 2621 or selecting between a 
headset, handset, or speaker phone at 2623. Once a voice mail message has been 
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recorded or opened from a previous session, a graphical representation of the 
voice mail message is displayed in a window with x and y dimensions, where the 
X dimension represents time and the y dimension represents the volume of the 
voice mail message at that point in time. The pointing device may be used to 
modify the voice message by graphically changing the two-dimensional voice 
message plot. The cursor is placed within the two-dimensional voice message 
plot in order to indicate the portion of the voice message to be modified. A 
scroll button beneath the two-dimensional voice message plot may be used to 
select the time portion of the message to be displayed v^thin the two- 
dimensional plot window. Time is shown increasing form the left to the right, 
corresponding to the x axis of the plot. As the scroll button is moved to the 
right, later portions of the voice message are displayed. As the scroll button is 
moved to the left, earlier portions of the voice mail message are displayed. A 
nimieric value is shown substantially on the left side of the two-dimensional plot 
windows, which is updated continuously and corresponds to the time value of 
the current location on the x axis. 

Upon the recording of a voice mail message, the voice mail may be added 
to the voice mail send queue, as displayed to the user in Figure 50 and described 
at 159 in Figure 23. Upon adding a voice mail message to the voice mail queue, 
the user is prompted, as shown in Figure 56, to enter a name and a telephone 
nimiber to whom the voice mail message must be sent. The user may select from 
a pre-determined list of voice mail recipients previously set up in the address 
book, as shown to the user in Figure 55. 

Figures 28 and 29 show how the fax code drivers of the PCS software 
typically work. The fax capability is tied to the windowing system print 
command so that facsimile transmissions may be sent by any software that can 
print through the windowing environment. At 2701 in Figure 28, a high 
resolution fax driver examines a print file at 2703 that has been printed by a 
windowing system application. The print file is then converted and imaged at 
2705 into a PCX bit-mapped format which is of the correct horizontal and 
vertical resolution in dots per inch (dpi) for high resolution facsimile devices. 
Figure 29 shows an equivalent process used by a low resolution fax driver at 
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2801. A print file 2803 is converted at 2805 to a low resolution PCX bit-mapped 
format for use with low resolution facsimile devices. Upon converting a print 
file to a facsimile document, the facsimile document may be added to the fax out 
queue, as displayed to the user in Figure 52. The user may select the name and 
5 telephone number of a person to send the fax to through the address book 

function, as shown to the user in Figure 55. Received faxes may be viewed or 
printed from the fax in queue. 

Figure 30 shows the multi-media control software that is invoked by a 
user selecting multi-media option 121 shown to a user in Figure 2. The user may 
10 initialize multi-media mail settings at 2901 , select various multi-media mail 

options at 2905, or invoke a message composer at 2903, as shown to the user by 
the screen display of Figure 57. In the message composer 2903, the user may 
□ open a new file at 2907 or cancel the message composer at 291 1 or open an 

s J existing file at 2909. Upon opening an existing file, the file name is selected at 

:; J5 2913, and the multi-media mail editor 2917 is invoked at 2915. While editing a 
''4 file at 2919 a user may select to alternately play a message at 2921 or record a 

m message at 2923. The user may edit in line mode either inserting, deleting, 

joining, or splitting lines; edit in block mode by moving, copying, deleting, or 
highlighting blocks of text; changing the font used to display the text; or 
^ ItO changing the indent and justification attributes of paragraphs of text. Standard 
^2 search features such as forward and backward search and global search and 

replace are also available through an "other" menu. Figure 3 1 fiirther describes 
the options available that are shown to the user by the multi-media edit display 
of Figure 57 (at 3001, file, line edit, block edit, fonts, paragraph, voice, "other," 
25 and help options are available). If at 3001 the user selects "file," the options 
shown in 3003 (save, save as, page lay-out, printer set-up, and print) are 
available to the user. If at 3001 the user selects "voice," the options available at 
3005 (record voice, stop recording, play voice, stop play, store voice to disk, and 
get voice fi-om disk) are available to the user. After selecting from edit controls 
30 3001, at 3007 the appropriate action is taken by the PCS software, the voice icon 
is displayed, and any additional graphics are also displayed. 
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After a multi-media message has been created, at 3 101 the user may add 
a message to the send queue. Upon adding the message, at 3 103 the user selects 
the name and telephone number of a person to send a message to, or at 3 105 
selects a name from the address book and at 3 107 selects a destination from the 
5 address book to send the message to. The message is then added to the job list at 
3109, and, at 31 1 1 the job scroll list is updated. 

Figure 33 shows the options available when a multi-media message has 
been received. At 3201 , the user may select and view the message. At 3203, the 
software selects the appropriate message, opens the job list at 3205, loads the 
10 received message at 3207, and invokes the message composer at 3209 in order to 
display the received multi-media message. 

Figure 34 shows the software control procedure used with the show-and- 
i'g tell feature to provide data over voice capability when selected at 123 from 

Figure 2. At 3301, any existing timers are disabled and the communications port 
^•\5 is initialized at 3303. The destination for any messages is accessed from the 
^ J address book at 3305, whereupon the telephone number is dialed and the 

J g connection is set up at 3307. Upon detecting a successfiil connection, the data 

over voice mode is initialized at 3309 while a message is being transmitted at 
1 - 3311 the user may select options of either quitting PCS at 33 1 5, or invoking a 

' 20 terminal emulation at 33 13. The data over voice connection is accomplished by 
J - multiplexing the bandwidth of the connection, as described elsewhere in this 

specification. 

Figure 35 shows the control procedure used when receiving data over 
voice messages. At 3401, any existing timers are disabled and the 

25 communications port is initialized at 3403. At 3405, the software waits for a 

ring indicator. If, after a pre-determined amount of time, no ringing indicator is 
detected, a time-out occurs at 3407, whereupon the PCS software aborts are 
returns at 3409. Otherwise, at 341 1 a ring indicator is received, the data over 
voice connection is established at 3413, and the user may select options at 3415 

30 either to quit the PCS software at 3419 and close the communications port at 
3421, or invoke terminal emulation at 3417. 
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Figure 36 shows the control procedure used when transmitting voice mail 
messages. At 3501, a voice mail message is added to the send queue, whereupon 
at 3503, the user specifies the message file name and, at 3505, indicates a name 
fi'om a previously entered address book entry 3505 and destination telephone 
number 3507 to send the message to. The message is then added to the job list at 
3509, and the queue display is updated at 351 1. 

Figure 37 shows the control procedure used when receiving voice mail 
messages. At 3601, a voice mail message is received and recorded by the PCS 
software. At 3603, the user selects a message to be reviewed or edited, after 
which the software opens the job list at 3605, loads the selected recorded 
message into memory at 3607, and invokes the voice editor at 3609 with the 
selected recorded message. The user may then select various functions fi"om 
within the voice editor to review or edit the message, as previously described 
above. 



Descriptions of the data structures and variable names and types are 
given below for the preferred embodiment of the present invention. The 
preferred embodiment is written in the C programming language and runs under 
Microsoft Windows software on an IBM PC or compatible system; however, it 
will be recognized that these data structures and methods are generic and 
potentially useful for a wide variety of other windowing software, systems, and 
programming languages. 

Address Book 

Address key types are used to indicate how information within the 
address book should be displayed to the user: 

KEY_NAME 0 list addresses by name 



Data Structures Description 



KEY_AFFILIATION 1 
KEY_STREETADRS 
KEY_CITYSTATE 
KEY ZIP 



by affiliation 



2 
3 
4 



by streetadrs code 
by citystate code 
by zip code 
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KEY_PHONE 
KEY_FAX 
KEY MISC 



5 
6 
7 



by phone code 
by fax code 
by misc code 



KEY TYPE COUNT 



8 



number of key types 



The address entry structure defines what fields are associated with each 
address book entry: 

s_address entry struct 



Key Names[KEY_TYPE_COUNT] = 
"Name", 
"Affiliation", 
"StreetAdrs", 
"CityState", 
"Zip", 
"Phone", 
"Fax", 
"Misc" 

The following character strings are used to hold address book 



unsigned int delFlag ; 
unsigned int caName ; 
unsigned int caAffiliation ; 
unsigned int caStreetAdrs ; 
unsigned int caCityState ; 
unsigned int caZip ; 
unsigned int caPhone ; 
unsigned int caFax ; 
unsigned int caMisc ; 



delete flag 
Name field 
Affiliation field 
Street Address field 
City State field 
Zip field 

Phone Nxmiber field 
Fax Nimiber field 
Miscellaneous field 



information: 



char 

static 

static 

static 

static 

static 

static 



g_adrsbooktemp[MAX_FILE_NAME_LEN] = "temp.adr" ; 

char g_ClipboardFormat[ ] = "CF_FLOCOM" ; 

char g_adrsbookDlgName[ ] = "AdrsBkDlg" ; 

char g_adrsbookFileName[MAX_FILE_NAME_LEN] ; 

char g_FaxStr[ADDRESS_ENTRY_FIELD_SIZE] ; 

char g_PhoneStr[ADDRESS_ENTRY_FIELD_SIZE] ; 

char g_NameStr[ADDRESS_ENTRY_FIELD_SIZE] ; 
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Fax Send and Receive Queues 

A structure definition is used to indicate how information within the fax 
send and receive queues is stored: 



5 struct sJobDesc { 
union { 

unsigned char phoneNum[32]; 
short nbrOflobs; 

}ul; 

10 union { 

short nextJobNum; 
unsigned char date [16]; 

}u2; 

unsigned char title [64]; 
1 5 unsigned char receiver [32] ; 

unsigned char coverFile[32]; 
□ unsigned char telNum[32]; 

0 unsigned char faxCdFile[8]; 

1 ^ unsigned char tinie[8] ; 
5 Jo WORD zState; 

•^^ time_t zTime; 

^ short zPages; 

i;Q short zResult; 

int zTries; 
r25 char zType; 

I ^ char zDummy [ 1 9] ; 

i y } tJobDesc, *tp_jobDesc, far *tpl_jobDesc; 

! static tjobDesc g_curJob, g_bu0ob; 

^lO 

Multi-Media Send and Receive Queues 

The following structure definition is used to indicate how information 
within the multi-media send and receive queues is stored: 



35 struct sjobDesc { 
union { 

unsigned char phoneNum[32]; 
short nbrOfFobs; 

}ul; 

40 union { 

short nextJobNum; 

unsigned char date [16]; 

}u2; 
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unsigned char title[64]; 

unsigned char receiver[32]; 

unsigned char coverFile[32]; 

unsigned char telNum[32]; 
unsigned char MMCdFile[8]; 
unsigned char time[8]; 
WORD zState; 
tinie_t zTime; 
short zPages; 
short zResult; 
int zTries; 
char zType; 
char zDimimy [ 1 9] ; 
} tJobDesc, *tpJobDesc, far *tplJobDesc; 

Show-and-Tell 

The show-and-tell structure definition is the same as that used in the 
address book. The static variables to define field sizes are given below: 



/* 

Static Varibles Used For Address Book Proc. The Variable 



Names Are Same As The Ones hi "Adrsbook", So That Same 

Modules Covdd Be Used 

*/ 

#define ADDRESS_ENTRY_FELD_SIZE 64 
#defineNUM_MEMBERS_ADRS_STRUCT 8 
#define NUM_ADRS_FIELDS 8 
#define VAL_PAUSE 2000 
#define PCKT_COM_TIME 20 
#define WAIT_RING_TIME 120 
#define DIAL_TIME 60 
//1 029 vasanth for delay before %pl command 
#define DELAYTIME 10 



/* Address Key Types */ 
#define KEY_NAME 
*/ 

#define KEY_AFFILIATION 
#define KEY_STREETADRS 
#define KEY_CITYSTATE 
#define KEY_ZIP 
#define KEY_PHONE 
#defme KEY_FAX 
#define KEY_MISC 
#define KEY_TYPE_COUNT 



0 


/* 


list addresses by name 


1 


/* 


by affiliation */ 


2 


/* 


by streetadrs code */ 


3 


/* 


by citystate code */ 


4 


/* 


by zip code */ 


5 


/* 


by phone code */ 


6 


/* 


by fax code */ 


7 


/* 


by misc code */ 


8 


/* 


number of key types */ 
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= 30 



Voice Mail Send and Receive Queues 

The voice mail structure definition and static variables to define field 






sizes are given below: 




5 


#define READCOUNT 


24000 




#define FRAME SIZE 


2000 




#define SCROLL_STEP 


4 




#definePOS IN PAGE 


100 




#define T_POSlf ION 


0.02 


10 


#define COMP FRAME SZ 

Ti VXwXXXXV/ V./V.yXVXX. X ^« 1 V ■ ■ J 


24 




#define BYTES IN FRAME 


12000 




#define BYTE_NUMBER 


6 




#define WAVE COEF 

TTVXWXXLX^ TV V X^ ^^V-^^X^X 


1 




#define FORMAT_STRl 


"%s -> %s # %s 


15 


#define FORMAT_STR2 


"%s -> %s # %s 


3 


#define OUT_JOBS_FILE_NAME 


"jobs" 


"5 


#define IN_JOBS_FILE_NAME 


"jobs" 


io 

n 


#define MAX_JOBS 10 






struct sJobDesc { 






union { 






unsigned char phoneNum[32]; 


25 


short 


nbrOfJobs; 




}ul; 





Schedule: %s: %s" 
Sent: %s: %s" 



union { 

short nextJobNum; 

xinsigned char date[16]; 



}u2; 

unsigned char title [64]; 

unsigned char receiver [3 2]; 
unsigned char coverFile[32]; 
3 5 unsigned char telNum [3 2] ; 

unsigned char faxCdFile[8]; 

xmsigned char time[8]; 

WORD zState; 

time_t zTime; 
40 short zPages; 

short zResult; 

int zTries; 

char zType; 

char zDunimy[19]; 
45 } t_jobDesc, *tp_jobDesc, far *tpl_jobDesc; 



74 



/ -v. 
static HWND hwndVMDlg; 




static int 
static int 
static int 



g_cxWave ; 
g_cy Wave ; 
g_nSamples ; 



/* width of waveform window */ 
/* height of waveform window */ 
/* sample counter */ 



static char g_outVMDir[MAX_FILE_NAME_LEN]; 

static char g_inVMDir|>lAX_FILE_NAME_LEN]; 

static HANDLE g_outJobsHndl = 0; 

static HANDLE g_inJobsHndl = 0; 

static HANDLE gJobFileHndl = 0; 

static tJobDesc g outJobsO; 

static tJobDesc g outJobsn; 

static t_jobDesc g_inJobsO; 

static tJobDesc g_inJobsn; 

static short g_numOfOutJobs =0; 

static short g nxmibOflnJobs =0; 

static int g VMOutlx = -1 ; 

static int g_VMInIx = -1 ; 

static OFSTRUCT jOfStruct ; 

static FILE *vdata ; 

static OFSTRUCT OfStruct ; 

static int hFile ; 

static OFSTRUCT oOfStruct ; 

static int hoFile ; 

static char g_inFileName[MAX_FILE_NAME_LEN] ; 

static int hjFile ; 

static char gJFileName|>lAX_FILE_NAME_LEN] ; 

static char g_uFileName[MAX_FILE_NAME_LEN] ; 

static char g_oFileName[MAX_FILE_NAME__LEN] ; 

static char g_sendListenFile[MAX_FILE_NAME_LEN] ; 

static char g_recListenFile[MAX_FILE_NAME_LEN] ; 



The following are variables to be used to write the playback or record 
level into the pcs.ini volume field names: 



char *g_Play Volume = "Play Volume"; 
char *g_Rec Volume = "Record Volume"; 

Default volmne levels for record and play: 

int g_recPos = 5; 
int g_playPos = 5; 
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char g_MicroVol[10] = ">MVO"; 
recording. 



Microphone level to be used for 



char g_PlayVol[10] = ">SVO"; 



Speaker level to be used for play. 



int g_ofBik = FALSE; 



Off hook flag indicating that either 
record or play is in progress. If flag 
is set only, then send packet 
commands to increase/decrease 
volume level. 



Below are given Static Variables Used For Address Book Proc. The 
Variable Names Are The Same As The Ones In "Adrsbook.c", So That Same 
Modules Could Be Used: 



The present inventions are to be limited only in accordance with the 
scope of the appended claims, since others skilled in the art may devise other 
embodiments still within the limits of the claims. 



The microfiche appendix of U.S. Patent Application No. 08/002,467 filed 
8 January 1993 (issued as U.S. Patent No. 5,452,289), incorporated herein by 
reference, contains the source code for the software running on the personal 
computer and the source code for the software running on the voice control 
DSP/CODEC. 



#define MAX_REC_VOL 
#define MIN_VOL_LEVEL 
#defineMAX PLAY VOL 



13 
0 
9 



#define MAX_ADRS_ENTRIES 
#define ADDRESS_ENTRY_FIELD_SIZE 
#define NUM_MEMBERS_ADRS_STRUCT 
#defineNUM ADRS FIELDS 



512 
64 
8 
8 



Microfiche Appendix 
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